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Preface to the volume Languages: Bioinspired 
Approaches 


Gemma Bel-Enguix 


Languages, whether they be natural or artificial, are particular cases of a 
symbol system. And the manipulation of symbols is the stem of formal 
language theory. The theory of formal languages mainly originated from 
mathematics and generative linguistics. It was born in the middle of the 
20th century as a tool for modelling and investigating the syntax of natural 
languages. After 1964, it developed as a separate branch with specific prob- 
lems, techniques and results and since then it has had an important role in 
the field of computer science. Formal language theory, due to its abstract 
and formal properties, has been applied to a wide range of fields (besides 
initial linguistic motivation): economic modelling, developmental biology, 
cryptography, sociology... Therefore, natural languages, computer science 
and formal languages had a mutual influence over the years. 


During the 20th century, biology became a pilot science, and many dis- 
ciplines have formulated their theories using models taken from nature. 
Bio-informatics, bio-engineering and bio-linguistics are good examples of 
this pre-eminence of ‘bio’-sciences. Computer science has almost become 
another bio-inspired field thanks to the considerable development of natu- 
ral computing and DNA computing. 


Following this tradition of interaction between biology and computer 
science, the articles in this volume present bioinspired approaches to formal 
language therory. Artion Alhazov and his co-author Tseren-Onolt Ishdorj 


XIV Gemma Bel-Enguix 


develop some aspects of Membrane Computing in the first two papers of 
the journal. Membranes are also the central topic of Mihai Ionescu’s contri- 
bution, that introduces P Systems with promoted /inhibited rules and traces. 
Adria-Horia Dediu presents some applications of evolutionary algorithms 
to formal languages. Maria Dolores Jiménez-Lépez applies one of the main 
processes in DNA, recombination, to formalize some patterns in syntax of 
natural languages. The paper by Remco Loos and Benedek Nagy explains 
some concepts of parallelism in DNA computing. Finally, the contribution 
by K.G. Subramanian, Mary Roslin and P. Helen Chandra is focused in the 
description of splicing for bidimensional languages. 


The work that is published in this volume has been been previously 
presented and discussed in the weekly seminars that the GRLMC organized 
during several years. The authors have improved and updated articles that 
provide new ideas, tools and formalisms which can help to the description, 
analyis and processing of natural and artificial languages. 


I am deeply grateful to all the contributors, for their professional and 


friendly cooperation, and to the Universitat Rovira i Virgili, for supporting 
the publication of this issue. 
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Minimal Parallelism and Number of Membrane 
Polarizations* 


Artiom Alhazov! 


Institute of Mathematics and Computer Science 
Academy of Sciences of Moldova 

Chisinau, MD-2028, Moldova 

E-mail: artiom@math.md 


Summary. It is known that the satisfiability problem (SAT) can be efficiently solved 
by a uniform family of P systems with active membranes that have two polarizations 
working in a maximally parallel way. We study P systems with active membranes 
without non-elementary membrane division, working in minimally parallel way. 
The main question we address is what number of polarizations is sufficient for an 
efficient computation depending on the types of rules used. 

In particular, we show that it is enough to have four polarizations, sequential 
evolution rules changing polarizations, polarizationless non-elementary membrane 
division rules and polarizationless rules of sending an object out. The same problem 
is solved with the standard evolution rules, rules of sending an object out and po- 
larizationless non-elementary membrane division rules, with six polarizations. It is 
an open question whether these numbers are optimal. 


* The author acknowledges project 4032 from the Science and Technology Center 
in Ukraine and project TIC2003-09319-C03-01 from the Rovira i Virgili University. 
The preliminary version of this paper was published in the pre-proceedings of the 
Workshop in Membrane Computing, Leiden, 2006 and presented at the seminar 
of the Research Group on Mathematical Linguistics, Tarragona. 


2 A. Alhazov 


1 Introduction 


Membrane computing with symbol-objects is a biologically inspired frame- 
work of distributed parallel multiset processing; see [10] for an overwiew 
and [13] for the comprehensive bibliography. The most addressed questions 
are completeness (solving every solvable problem) and efficiency (solving 
hard problems in a feasible time). We focus on the latter. 

An interesting class of membrane systems are those with active mem- 
branes (see [9]), where membrane division can be used for solving compu- 
tationally hard problems in polynomial time. Let us mention a few results: 


e A semi-uniform solution to SAT using three polarizations and division 

for non-elementary membranes, [9]. 

A polarizationless solution, [2]. 

Only division for elementary membranes, with three polarizations [11]. 

A uniform solution, with elementary membrane division, [12]. 

Only two polarizations, in a uniform way, with elementary membrane 

division [3]. 

e Computational completeness of P systems with three polarizations and 
three membranes [10]. 

e Only two polarizations and two membranes [6]. 

e Only one membrane, with two polarizations [4, 5]. 

e Polarizationless systems are complete, with no known bound on the num- 
ber of membranes [1]. 

e Solving SAT in a minimally parallel way, using non-elementary mem- 
brane division (replicating both objects and inner membranes) [7]. 

e Polarizations avoided by using rules that change membrane labels. To the 
best of the author’s knowledge the rules are either cooperative or non- 
elementary division as above [8]. 


Given a P system, a rule and an object, whether the rule is applica- 
ble to the object in some membrane might depend on both the membrane 
label (that usually cannot be changed) and the membrane polarization. Es- 
sentially, the number of polarizations is the number of states that can be 
encoded directly on the membrane. 

Minimal parallelism provides less synchronization between the objects, 
so one might expect the need for stronger control (i.e., more polarizations). 
It is not difficult to construct the system so that the rules are global (i.e., the 
membrane labels are not distinguished), probably by not adding additional 
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polarizations. In this way, the results dealing with the number of polariza- 
tions can be reformulated in terms of number of membrane labels (in which 
case the systems have no polarizations, but the rules are allowed to modify 
membrane labels). 


2 Preliminaries 


2.1 Solvability by P systems with input 


Definition 1. A P system with input is a tuple (I1,&, ir), where (a) II is a P 
system with working alphabet , with m membranes labelled with 1,--- ,m, and 
initial multisets w1,--- ,Wm (over O — X) associated with them; (b) X C O is an 
(input) alphabet, (c) iy is the label of a distinguished (input) membrane. 


The initial configuration of (II, x, ity) with an input multiset w over & is 
(1, W1,° te Wir, U Wr? ,Wm). 


We call (II, Z, itz) a decisional P system with input if there exist two distin- 
guished objects yes,no € O and for any valid input (see cod function in the 
definition below) all its computations send to the environment exactly one 
object: either yes (in this case the computation is called an accepting one) 
or no. Moreover, (II, %, iz) is called confluent if for any valid input all its 
computations halt in the same configuration. 


Definition 2. Consider a decision problem X = (Ix,0x): Ix is the set of possible 
instances of X and @ x is a Boolean function over Ixy. We say that X is solvable in 
polynomial time by a uniform family of P systems II = (11(n)) nen if the following 
conditions hold: 


e The family I is polynomially constructible: i.e., there exists a deterministic 
Turing machine constructing the system I1(n) from n in polynomial time. 

e There exists a pair (s,cod) of polynomial-time computable functions mapping 
every instance u € Ix of the problem X into a natural number and a multiset 
(over the alphabet of I1(s(u))), respectively. The instance u is to be solved by 
a system I1(s(u)) with the multiset cod(u) placed in the input membrane, as 
described below. 

e The family II is polynomially bounded with respect to (X,cod,s): i.e., there 
exists a polynomial function p(n) such that for each u € Ix every computation 
of the system II(s(u)) with input cod(u) halts in at most p(s(u)) steps. 
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e The family II is sound with respect to (X,cod,s): i.e., for each u € Ix if there 
exists an accepting computation of I1(s(u)) with input cod(u), then @x(u) = 
1. 

e The family II is complete with respect to (X,cod,s): i.e, for each u € Ix 
if 0x(u) = 1, then every computation of II(s(u)) with input cod(u) is an 
accepting one. 


2.2 P systems with active membranes 


Definition 3. A P system with active membranes is a P system with the working 
alphabet O,the set H of membrane labels, the set E of polarizations, and the rules of 
the following forms: 


(a)[a],u ]), fora € O,u € O*,h € Hande € E. These are object evolution 
rules. An object a € O in the region associated with a membrane with label h 
and polarization e evolves to a multiset u € O*. 

(b) al], > [b iA fora,b € O,h € Hand e,e’ € E. These are send-in communi- 
cation rules. An object a from the region immediately outside a membrane with 
label h and polarization e is introduced in to this membrane, transformed into b 
and the polarization of the membrane is changed to e’. 


(c)[ a], > [ [fb fora,b € O,h € Hand ee’ € E. These are send-out 
communication rules. An object a is sent out from the region associated with 
a membrane with label h and polarization e to the region immediately outside, 
transformed into b and the polarization of the membrane is changed to e’. 

(d)[a];, > 4 fora,b © O,h € Hande © E. These are dissolution rules. A 
membrane with label h and polarization e is dissolved in reaction with an object 
a, and transformed into b. The skin is never dissolved. 

(e)[a], >[b Al c lf for a,b,c € O,h € Hand e,e’,e" € E. These are divi- 
sion rules for elementary membranes. An elementary membrane can be divided 
into two membranes with the same label, possibly with different polarizations, 
possibly transforming some objects. 


Generally, rules of type (a) are executed in parallel, while at most one rule 
out of all rules of types (b), (c), (d), (e) can be applied to the same membrane 
in the same step. We will also speak about the sequential version 


(a’) [a]£ +[u]% fora€O,u€ O*,h € Hande,e EE. 


of rules (a) (let us use ” to indicate that the rule is allowed to change the 
polarization of the membrane) and their modifications (bo), (co), (do), (eo), 
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(495), (b5), (co), (@9) (here, 0 represents that the rules neither distinguish po- 
larization nor change it, while ‘ means that the rule is allowed to change 
membrane label). 


2.3 Minimal parallelism 


These rules are applied according to the following principles: 


e The rules of type (a) may be applied in parallel. In one step, a membrane 
can be the subject of only one rule of types (a9,), (a7) and (b), (c), (d), (e) 
with their modifications. 

e Inone step, one object of a membrane can be used by only one rule (non- 
deterministically chosen), but for every membrane at least one object 
that can evolve by one rule of any form, must evolve (no rules associated 
to a membrane are applied only if none are applicable). 

e If at the same time a membrane is divided by a rule of type (e) and there 
are objects in this membrane which evolve by means of rules of type (a), 
then we assume that the evolution rules of type (a) are used first, and 
then the division is produced. Of course, this process takes only one 
step. 


3 Using rules (a?) 


The three size parameters of the SAT problem are the number m of clauses, 
the number n of variables and the total number ! of occurrences of variables 
in clauses (clearly, 1 < mn: without restricting generality, we could assume 
that no variable appears in the same clause more than once, with or without 
negation). 


Theorem 1. A uniform family of confluent P systems with rules (al), (co), (eo) 
working in minimally parallel way can solve SAT with four polarizations in 
O(l(m + n)) number of steps. 


Proof. The main idea of the construction is to implement a maximally paral- 
lel step sequentially. To do so, a “control” object will change the polarization, 
and then an input object or a clause object will restore it. Since the input 
is encoded in | objects, changing and restoring polarization will happen ! 
times, and the counting is done by the “control” object. 

Let us consider a propositional formula in the conjunctive normal form: 
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B= QV-°:VCn, 
Ci =Yir AN AYily 1<i<m, where 
Yee ae | Spon le re mS rsd, 


m 
l=]. 
i=1 


Let us encode the instance of f in the alphabet &((n,m,1)) by multisets 
X, X’ of the clause-variable pairs such that the variable appears in the clause 
without negation, with negation or neither: 


X((n,m,1)) = {0ji1s 
X = {(i11/1) | xi € {yx 
L<jem 1a 
X! = {(0j412,1) | mx € {yn | 1 Sk <j}, 
Le pom Leta ay. 


1<j<m,1<i<9,1<5 <2}, 
1<k<ij}, 


We construct the following P system: 


T1((n,m,1)) = (O,H,E,[[ ]3[ ]3]}, wi, we, w3,R), with 
O= {ojj5xs|1< jam, 1<i<n, 
1<k<m+n+1,1<s< 4} 
U {diz |1Sicm+nt1,1<k< 2} 
U {tin fix |1<isn1<k<t 
U {dj |1<i<m+n+1}U {S,Z,yes,no} 
U {ze |1<k < (414+3)n4+m(4l + 1) +2} 
= A, W = dh, we = 2, H = {1,23}, E =40,1,2,3}, 


and the rules are listed below. The computation consists of three stages. 


1. Producing 2” membranes with label 2, corresponding to the possible as- 
signments of variables x1,--- ,Xn and selecting clauses that are satisfied 
for every assignment (groups A and C of rules). 

2. Checking whether all clauses are satisfied for all assignments (groups of 
rules B and D). 

3. Generating yes from the positive answer, and sending it to the environ- 
ment. Generating no from the timeout (during the first two stages the 
number of steps is counted in the object in membrane with label 3) and 
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sending it to the environment if there is no positive answer (groups of 
rules E and F). 


Stage 1 consists of n cycles and stage 2 consists of m cycles. Each cycle’s aim 
is to process all / objects: i.e., each object counts the number of cycles com- 
pleted, and in the first stage the clauses are evaluated while in the second 
stage the presence of each clause is checked. 

In the case of maximal parallelism, a cycle can be performed in a con- 
stant number of (actually, one or two) steps, while minimal parallelism can- 
not guarantee that all objects are processed. The solution used here is the 
following. A cycle consists of marking (setting the last index to 3 or 4) all ! 
objects one by one while performing the necessary operation, and then un- 
marking (setting the last index to 1 or 2) all of them. Marking or unmarking 
an object happens in two steps: the control object changes the polarization 
from 0 to 1, 2 (to mark) or 3 (to unmark), and then one of the objects that 
has not yet been (un)marked is processed, resetting the polarization to 0. 


Control objects in membrane 2: select clauses 
Al (for variable i: divide) 
d;] > |tioll fio],1<icn 
A2 (process and mark all ! objects) 
feng | [hy | StS, 1 Sk <1 
faa! >lfel Stan iskst 
A3 (prepare to unmark objects) 
tin ]° > [dig], 1<i<n 
fil = dg ll ste 
A4 (unmark all / objects) 
dpa || de  IStReIeeSd 
A5 (switch to the next variable) 
din |” > [dizi |) 1Sism 


Control objects in membrane 2: check clauses 
B1 (test if clause i is satisfied) 
[dnsi]° + [dntia 7, 1<ism 
B2 (process and mark the other / — 1 objects) 
[dnsie—1]° > [deie |, 1Sism1<k<! 
B3 (unmark all / objects) 
(Corer ee a )° [ansisre P1<ism1<k<i 
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B4 (switch to the next elause) 

[ dn+i2r ig a [ dntit4 ] ,l<i<m 
B5 (send a positive answer) 

[4intnt1 =e ]s 


Input objects in membrane 2: select clauses 

C1 (mark an object) 
[vies 1” > [ P2e41542 |”, 
l<i<m1l<j<nil<k<mkAmi1i<s<21<p<2 

C2 (a true variable present without negation or a false variable present with 
negation satisfies the clause) 

Vjsi,i,8 lb => [ % 4413 1°, 1<i<mi1< j <n,l<s<2 

C3 (a true variable present with negation or a false variable present without 

negation does not satisfy the clause) 

Vii,i3—s i —> [ 21414 1°, 1<i<mi1< J <n,l<s<2 

C4 (unmark an object) 

Oj i,k s+2 i > [ Vjii,k,s ie 

1<i<m1<j<n,2<k<m4+1,1<s<2 


Input objects in membrane 2: check clauses 
D1 (check if the clause is satisfied by at least one variable) 
Vjim+jal? > [%iev1a]?, 1 <i<m,1<j<nis<s<2 
D2 (mark an object) 
Vjin+k,s (i + [ Op iks1s42 Ie 
1l<i<m1l<j<nl<k<n1l<s<2 
D3 (unmark an object) 


3 0 
Viimtks+2 |” > [245] 
l<i<m1<j<n2<k<n4+1,1<s<2 


Control objects in membrane 3 
E1 (count) 

[zr-1]° 9 [ze | 1<k< N= (414+3)n + m(4l +1) +2 
E2 (send time-out object) 

[zw] [12 


Control objects in the skin membrane 
F1 (a positive result generates the answer) 
[$]°— [yes]! 
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F2 (without the positive answer, the time-out generates the negative answer) 
[Z |° > [no ]° 
F3 (send the answer) 


[yes] >| lyes 
[no] + [ ]no 


Let us now explain how the system works in greater detail. 

Like the input objects, the control objects keep track of the number of cy- 
cles completed. The control object also remembers whether marking or un- 
marking takes place, as well as the number of objects already (un)marked. 
Moreover, the control object is responsible for sending the “right” informa- 
tion to the objects via polarization: in stage 1, 1 if the variable is true, and 2 
if the variable is false; in stage 2, 1 if the clause has already been found, and 
2 if the clause is being checked for. 

During the first stage, an object 7; ;,1,; is transformed into 0;,;n41,1, where 
t = 1 if variable x; satisfies clause C;, and t = 2 if not. The last index 
changes from s to t when the third index is equal to i. Notice that, although 
only information about what clauses are satisfied seems to be necessary 
for checking if 6 is true for the given assignment of the variables, such 
information as the number of cycles completed is kept for synchronization 
purposes, and the other objects are kept so that their total number remains !. 
The control object d; is transformed into d,+1. Stage 1 takes (4/ + 3)n steps. 

If some clause is not satisfied, then the computation in the correspond- 
ing membrane is “stuck" with polarization 2. Otherwise, during the second 
stage an object 0; ;n41,4 is transformed into 0} jn4m+1,t, While the control ob- 
ject d, +1 becomes dj,441. Stage 2 takes m(4l + 1) steps, plus one extra step 
to send objects S to skin, if any. 

After stage 2 is completed, one copy of 5, if any, is transformed into yes, 
changing the polarization of the skin membrane. If a yes is sent out, at the 
same time object Z comes to the skin from region 3. If the polarization of 
the skin is still 0, Z changes to no, and is then sent out. Depending on the 
answer, stage 3 takes 2 or 4 steps. In either case, the result is sent out in the 
last step of the computation. 


Notice that membrane labels are not indicated in the rules. This means 
that the system is organized in such a way that the rules are global: ie., 
the system would work equally well if it started with the configuration 
w= [wy[ we ii W3 li le The labels are only given for the simplicity of 
explanation. 
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Using the remark at the end of the Introduction, we can obtain the fol- 
lowing 


Corollary 1. A uniform family of confluent polarizationless P systems with rules 
(455), (Co), (eo) working in minimally parallel way can solve SAT with membrane 
labels of four kinds. 


The statement follows directly from the possibility of rewriting a global rule 
[a]° > [u ld of type (a/) asarule [a], > [u],, of type (a),) (which is 
polarizationless but can change the membrane label). 


4 Using rules (a) 


An informal idea in this section is to replace rules of type (a) with rules 
(a) producing additional objects, and rules (c), sending an additional object 
out to change the polarization. 


Theorem 2. A uniform family of confluent P systems with rules (a),(c), (eo) 
working in minimally parallel way can solve SAT with six polarizations in O(1(m + 
n)) number of steps. 


Proof. The strategy used in the construction below is similar to that of the 
previous theorem. However, since the application of the evolution rules no 
longer changes the polarization of the membrane, the control symbols d; ¢, 
tit, fik no longer “operate” in polarization 0, but rather in a polarization 
that toggles between 0 (for even k) and 5 (for odd k), to prevent multiple 
applications of evolution rules in one row in the same membrane. Moreover, 
the input objects are actually allowed to evolve in parallel (and the degree 
of parallelism is chosen non-deterministically), but at the end of both halves 
of a cycle the number of extra objects produced can be counted, to make 
sure that all / objects have been processed. 
For the same propositional formula 


B=QV-::VCn, 
Ci =Yin N+ AYily 1<i<m, where 
V2 EG oe | Iason lets mere, 


m 
l=]. 
i=1 
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and the same encoding of the instance of 6 in the alphabet X&((n,m,1)) by 
multisets X, X’, 
Ein; ))) = [opie TS jam, 1 <t<n, le a2), 
X= {(% i111) |i € {yjx | 1 Sk < Tj}, 
l<j<m 1<i<n}, 
X= {(0j412/1) | x; € {yjx | 1 Sk SU}, 
SG Se ee ce ae 


we construct the following P system: 


T1((n,m,1)) = (O,H,E,[[ ]3[ 13 }}, 1,2, w3,R), with 
O={er (Lei sm latan 
1<k<m+n+4+1,1<s<4} 
U {dix |1<icm+n+1,1<k <2} 
Ute ie Iaien tere} 
U {dj |1<i<m+n+1}U{S,Z,yes,no} 
U {zp |1<k < (414+3)n + m(4l +1) +2} 
U {o,;|0<i<5,0<j <5} 
wy, =A, WwW, = dy, W3 = 2, H = {1,2,3}, E = {0,1,2,3,4,5}, 


and the rules are listed below. The computation stages are the same as in 
the previous proof. 


1. Producing 2” membranes for all the possible assignments of variables; 
selecting satisfied clauses (groups A and C). 

2. Checking whether all clauses are satisfied (groups B and D). 

3. Generating the answer and sending it to the environment (groups E and 
F). 


Stage 1 consists of n cycles and stage 2 consists of m cycles. Each cycle’s 
aim is to process all | objects: i.e., each object counts the number of cycles 
completed. In the first stage the clauses are evaluated while in the second 
stage the presence of each clause is checked. 

A cycle consists of marking (setting the last index to 3 or 4) all ! objects 
one by one while performing the necessary operation, and then unmarking 
(setting the last index to 1 or 2) them. An object is generally marked or 
unmarked in five steps: 
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1. The control object produces two “polarization changers". 

2. One of them changes the polarization from 0 or 5 to 1, 2 (to mark) or 3 
(to unmark). 

3. One of the objects that has not yet been (un)marked is processed, pro- 
ducing a “witness" — yet another “polarization changer". 

4. The “witness” switches the polarization to 4. 

5. The second “changer” produced in step 1 of this routine changes the 
polarization to 5 or 0. 


Notice, however, that “step" 3 might actually take more than one step (more 
objects can be (un)marked in parallel, or even in a row, thus creating a 
supply of “witnesses"). Step 4 might actually be executed in parallel with 
the last step of “step” 3 (sending out a previous “witness” while producing 
more). Finally, “step" 3 might even be skipped if a previous “witness” is 
already there. What matters is that the whole (un)marking routine should 
take at most 5/ steps. 


Changing the polarization of membrane 2 

Ol (change from i to /) 
[o;;]' + [ lop 0s71=5,05j545 

O2 (“witnesses" of D2 are “compatible” with “witnesses” of D1; this does 
not interfere with the rest of the computation) 


[ora]? > [ ]* 045 


Control objects in membrane 2: select clauses 
Al (for variable i: divide) 
di] > [toll fio]l,l<isn 
A2 (process and mark all ! objects) 
tip—1 — tix001045 ]°,1<i<n,1<k<I,kis odd 
fik—-1 > fix00,2045 i 1<i<n,1<k<l1,kis odd 
tin-1 — tix05,1049 |?,1<i<n,1<k<1I,kis even 
fik-1 > Fi,k05,204,0 ie 1<i<n,1<k<l1kiseven 
A3 (prepare to unmark objects) 
tip > dip ie 1<i<n,iflis even 
fil > di ie 1<i<n,iflis even 
tit + digoso |°, 1 <i <1, if lis odd 
fit > di050 |?, 1 <i <n, if lis odd 
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A4 (unmark all / objects) 
[ dip-1 > dip003045 ]°, 1 <i<n,1<k<I,kisodd 
[ dix—-1 > 4;,05,3040 }2.1<i<n,1<k<1,kis even 
A5 (switch to the next variable) 
[ dit > diya |, 1 <i <n, if lis even 
[ dit + di41050 |, 1 <i <n, iflis odd 


Control objects in membrane 2: check clauses 
B1 (test if clause i is satisfied) 
dni > dn4i 1002045 ]°, 1 <i<m 
B2 (process and mark the other / — 1 objects) 
An4+ik—1 > An +i 00,1045 I 1<i<m1<k<I,kis odd 
An+ik—1 > Ay +i405,104,0 i 1<i<m1<k<lI,kis even 
B3 (unmark all / objects) 
An+il+k—1 > Ay +i14+k00,304,5 lie? 1<i<m1<k<I1,1+kis odd 
An+il+k—1 > An +i14+k05,304,0 L; 1<i<m1<k<I1,1+kis odd 
B4 (switch to the next clause) 
An+i2l > An sist i 1l<i<m 
B5 (send a positive answer) 
dm+nti |” + [ ]°S 


Input objects in membrane 2: select clauses 
C1 (mark an object) 


[ Miks > %ie+15+20p4 |”, 
l<i<m1l<j<nil<k<mkAm1<s<2,1<p<2 


13 


C2 (a true variable present without negation or a false variable present with 


negation satisfies the clause) 
Vjiis —_ Vj ii+1,398A4 F. 1<i< m, 1< J <n, 1<s<2 


C3 (a true variable present with negation or a false variable present without 


negation does not satisfy the clause) 
Uiiis—s > Yiit140ea], 1 <Sism,1<j<ni<s<2 
C4 (unmark an object) 


3 
Ojiks+2 7 0} iks03A | , 
l<i<m1<j<n2<k<m+11<s<2 


Input objects in membrane 2: check clauses 
D1 (check if the clause is satisfied by at least one variable) 
[Viana > Gaara | 1 <i Sm 1<j<n,1<s <2 
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D2 (mark an object) 


1 
[ Oj imtks — 0j,ik+1s+2014 l, 
1l<i<m1l<j<nl<k<n1l<s<2 
D3 (unmark an object) 


3 
[ Mjim+ks+2 > 71,503 |”, 
l<i<m1<j<n2<k<n4+1,1<s<2 


Control objects in membrane 3 
E1 (count) 

[Ze-1 9 2 |, 1 <k < N= (101 4+-5)n + m(101 +1) +2 
E2 (send time-out object) 

ie O01 


Control objects in the skin membrane 

F1 (the first positive result sends the answer) 
[S]° > [ ]'yes 

F2 (without the positive result, the time-out sends the negative answer) 
[Z]° + [ ]®no 


Let us now explain how the system works in greater detail. The control 
objects keep track of the number of cycles completed, whether marking or 
unmarking takes place, as well as the number of objects already (un)marked. 
Moreover, the control object is responsible for sending the “right” informa- 
tion to the objects via polarization: in stage 1, by generating 091 or 05, if the 
variable is true, and 099 or 05 if the variable is false; in stage 2, 09 or 051 
if the clause has already been found, and 09, or 05 if the clause is being 
checked for. 

During the first stage, an object 0; 1,5 is transformed into 0; ;n+1,4, where 
t = 1 if variable x; satisfies clause C;, or t = 2 if not. The last index changes 
from s to tf happens when the third index is equal to i. The control object 
d, is transformed into d,,,1. Stage 1 takes at most (10/ +5)n steps (at most 
(10! + 3)n in the case when ! is even). 

If some clause is not satisfied, then the computation in the correspond- 
ing membrane is “stuck” with polarization 2. Otherwise, during the second 
stage an object 0; 41, is transformed into 0; jn+m-+1,, While the control ob- 
ject dy41 becomes dyy441. Stage 2 takes at most m(10/ + 1) steps, plus one 
extra step to send objects S to skin, if any. 

After stage 2 is completed, one copy of S, if any, is sent out as yes, 
changing the polarization of the skin membrane. After this time has passed, 
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object Z comes to the skin from region 3. If the polarization of the skin is 
still 0, Z is sent out as no. 


The rules of the system in the proof above are also global, so we can again 
obtain the following 


Corollary 2. A uniform family of confluent polarizationless P systems with rules 
(a), (co), (€0) working in minimally parallel way can solve SAT with membrane 
labels of six kinds. 


5 Conclusions 


Changing membrane polarization controls what rules can be applied, so the 
number of polarizations corresponds to the number of states of this con- 
trol. Moreover, almost the only way the objects of the system may interact 
is by changing the membrane polarization. Therefore, the number of polar- 
izations is a complexity measure that deserves our attention. 

For maximal parallelism it has been proved that two polarizations are 
sufficient for both universality (with one membrane) and efficiency, while 
one-polarization systems are still universal (with elementary membrane di- 
vision and membrane dissolution), but are conjectured not to be efficient. 

We have proved that efficient solutions of computationally hard prob- 
lems by P systems with active membranes working in a minimally parallel 
way can be constructed avoiding both cooperative rules and non-elementary 
membrane division, thus improving results from [7],[8]. For this task, it is 
enough to have four polarizations, sequential evolution rules changing po- 
larizations, polarizationless elementary membrane division rules and po- 
larizationless rules for sending an object out. The standard evolution and 
send-out rules, as well as polarizationless elementary membrane division 
rules, can be used; in this case, six polarizations suffice. 

The first construction is “almost” deterministic: the only choices the sys- 
tem can make in each cycle is the order in which the input systems are pro- 
cessed. The second construction exhibits a more asynchronous behaviour 
of the input objects, which, depending on the chosen degree of parallelism, 
might speed up the positive answer, but by less than 20%. 7 In this case, 


2 The maximal total number of steps needed is slightly over 10/(m + mn); the fastest 
computation happens if rules C2 are executed in parallel for all input objects, as 
well as rules C4, D2, D3, saving lm — 1, lm — 1, In — 1, In — 1 steps, respectively. 
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controlling polarizations by evolution is still faster than controlling polar- 
izations by communication. 

A number of interesting problems related to minimal parallelism re- 
main open. For instance, is it possible to decrease the number of polariza- 
tions/labels? Other computational problems in the minimally-parallel set- 
ting can also be studied: for instance, the computational power of P systems 
with one active membrane working in the minimally parallel way. 
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investigate a particular combination of types of rules that can be used in solving the 
SAT problem in linear time. 


1 Introduction 


Such operations as membrane fusion (merging), membrane fission (bud- 
ding, separation), and release of vesicle contents are well known phenom- 
ena in cell biology. Many macromolecules are too large to be transported 
across membranes through protein channels, which is why they are trans- 
ported by means of vesicle formation. This process can transport packages 
of chemicals into or out of the cell, the content of the vesicle is released, and 
the vesicle fuses with the cell membrane. 

Informally speaking, in P systems with active membranes without polar- 
izations six types of rules are used: (a9) multiset rewriting rules, (bo) rules 
for introducing objects into membranes, (co) rules for sending objects out 
of membranes, (dg) rules for dissolving membranes, (eg) rules for dividing 
elementary membranes, and (fy) rules for dividing non-elementary mem- 
branes (see [1]). In these rules, a single object takes part in the process. We 
introduce here some further types of rules: (go) membrane merging rules, 
(ho) membrane separation rules, and (ig) membrane release rules, all in 
the framework of P systems with active membranes. The common feature 
of these rules is the transport of multisets of objects among regions of the 
system. 

Some operations with membranes, other than dissolution and division 
which are considered to be basic in membrane computing, were also intro- 
duced in [3] and [2]. 

In P systems, exponential workspace is obtained by dividing mem- 
branes, creating membrane, and replicating strings. We will see an interest- 
ing new way for obtaining exponential workspace in linear time, by using 
membrane separation. Some particular combinations of types of rules in 
P systems with active membranes can solve hard problems, typically NP- 
complete problems, in linear time. This possibility is illustrated here with 
the SAT problem. 
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2 Operations on membranes without polarization 


The reader is assumed to be familiar with the fundamentals of membrane 
computing, e.g., from [4]; details can be also found at http: //ppage. psystems.eu/. 


We are considering a P system defined as II = (O,H, pt,w1,...,Wm,R), 
where 


1. m > 1 is the initial degree of the system; 
2. O is the alphabet of objects; 
3. H is a finite set of labels for membranes; 
4. is a membrane structure, consisting of m membranes, labeled (not nec- 
essarily in a one-to-one manner) with elements of H; 
5. W1,...,Wm are strings over O, describing the multisets of objects placed in 
the m regions of 4; 
6. Ris a finite set of developmental rules, of the following forms: 
(a9) [a> 0 ],, forh € H,a € O,v € O* 
(object evolution rules, associated with membranes and depending 
on the label, but not directly involving the membranes, in the sense 
that the membranes are neither taking part in the application of 
these rules nor are they modified by them); 
(bo) al ], 2 [6],,forh € H,a,b€O 
(communication rules; an object is introduced in the membrane dur- 
ing this process); 
(co) [a ], > [ ],b,forhe€ H,a,beO 
(communication rules; an objects sent out of the membrane during 
this process); 
(do) [@ |, 2 5, forh € H,a,b €O 
(dissolving rules; in reaction with an object, a membrane can be 
dissolved, while the object specified in the rule can be modified); 
(eo) [4], ~[b6],[¢],, for hk € H,a,b,c€O 
(division rules for elementary membranes; in reaction with an ob- 
ject, the membrane is divided into two membranes with the same 
label; the object specified in the rule is replaced in the two new 
membranes by possibly new objects); 
(0) (Jal Jn [Inv for  € H 
(merging rules for elementary membranes; in the reaction of two 
membranes, they are merged into a single membrane; the objects of 
the former membranes are put together in the new membrane); 


; TRIANGLE 6 e December 2011 


22 A. Alhazov, T.-O. Ishdorj 


(ho) [O], ~[U],[0O-U],, forhe H,UCO 
(separation rules for elementary membranes, with respect to a given 
set of objects; the membrane is separated into two membranes with 
the same labels; the objects from U are placed in the first membrane, 
those from U — O are placed in the other membrane); 

(io) [O lh dh WP 1,0, forh © H 
(release rule; the objects in a membrane are released from a mem- 
brane, surrounding it, while the first membrane disappears). 


The rules are applied non-deterministically, in the maximally parallel man- 
ner; among the rules of types (bo),--- , (ig) at most one can be applied to 
each membrane in each step. 

Rules of types (aq), (bo), (co), (do), and (e9) were introduced in [1], with- 
out membrane polarizations, and without the ability to change the mem- 
brane labels they involve (this is the case in [4] with rules of type (b), (c)). 
Moreover, in [1] rules are considered that can change the membrane labels, 
and they are called type (4), (bg), (co), and (e9). We use this idea and 
this notation also in rules of types (go), (Mo): their primed versions indicate 
the fact that the labels can be changed. Specifically, these rules are of the 
following forms: 


(80) [day l Ving > L Vinge for fi,h2,hs € H. 
(ng) [O],, +[U],,[0-U],,, for hy, lo,hg € H,U CO. 


In what follows we will see how a particular combination of types of rules 
can be used to solve SAT in linear time. 


3 Efficiency 


From [1] we know that P systems with rules of types (a), (bo), (co), and 
(eg) can solve SAT in linear time. In solving SAT, we can eliminate membrane 
division (ej) by using membrane separation (19). 


Theorem 1. P systems with rules of types (ag), (bo), (co), (hp) can solve SAT in 
linear time in a confluent way. 


Proof. Let us consider a propositional formula in the conjunctive normal 
form: 
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B=CQA+::ACn, 
Ci =Yi Veo Vil, 1<i<m, where 
Vix € [oy | Sj snp tei sles eG, 


The instance f of SAT will be encoded in the rules of the P system by multi- 
sets v; and v’ of symbols, corresponding to the clauses satisfied by assigning 
x; to be true and false, respectively: 


oj = {eo | x; E {yz |1 Sk <Uh1<i<m},1<j<n, 
v, = {c; | ax; € fyi | 1 <k <1}. <i<mb1<j<n. 
j po ee J 


We construct the P system 
II = (O,H, ut, Wo, W1, Ws, R), with 
O = {dj,d,|0<i<n+m+5} 
U ligt tind | i) < i < j < n} 
U {cj |1<i< m}U {t, yes,no}, 


H=([ ol lil. 
ws =A, 
wo = do, 
wW1 = do, 


H = {s,0,1,---,m+2}, 


and the following rules (we accompany them with explanations about how 
they are used): 


e Generation phase: 

Gl. di male divatintindir iri Pret < i <n-1, 
G2.| dj > disttipindisr asfisis ly lsisn-1 
G3[O],>[U],[O-U],, 

where U = {d;|1<i<n}Ut{t,,fi;|1<isj <n}, 
G4. fi j = tijasty iy hay 

fig > fijrifijer ly 

ij > tijtttijar ly 

ee oat fijfiju lplstsjan 

In n steps, 2” membranes with label 1 are created, corresponding to all possi- 
ble 2” truth assignments of the variables x1, x2,--- ,Xn. During this process, 


; TRIANGLE 6 e December 2011 


24 A. Alhazov, T.-O. Ishdorj 


objects ti jr 
correspond to the false value of variables x;. These 2” copies of membranes 
1 are placed in the skin membrane (the system always has only two levels 


of membranes). 


th ; correspond to the true value of variables x;, and objects i,j, fi : 


Every object t;,;, Ei is fifi j evolves to tintin fins fi, vespectively. Then 


in’ 
these objects evolve into objects c;, corresponding to clauses Cj, satisfied 


by the true or false values chosen for x;. 


e Checking phase: 
C1.[0]; > [U;],[O- Ui] j41, 
where U; = {cj},1 <i<m. 


Next, starting with i = 1, in membranes with label i, objects c; will be 
separated from the other objects, and the label of the membrane with objects 
O — {c;} will become i+ 1. The membranes which do not contain objects 
cj41 will never evolve anymore. If all objects cj,1 < i < m, are present 
in some membrane, then after m steps this membrane will evolve into a 
membrane with label m + 1, containing objects dy, de by the rules C1. 


C2.[ di, > dn 
C3. [ dn Par 
C4.[ dy — tt | 


- 
If 6 has solutions, then at step n +m +1, every membrane corresponding 
to a solution of B ejects d, in the skin region, and they will all be rewritten 
into tt. 


Chal la ae le hi 
C64] Jig > [4] 
c7/O],—(U'] 
where U! = {t}, 
C8.| d; =} dis4 les 
C9.| d; a di+4 | 


m+’ 


mew =n Pm +e. 


At step n+ m-+3, one copy of ¢ enters the membrane with label 0, and 
(assuming £ has s solutions, 1 < s < 2”) s copies of t enter the s membranes 


; TRIANGLE 6 e December 2011 


Bio-Inspired Membrane Operations in P Systems with Active Membranes 25 


with label m + 1, at step n + m+ 4, s —1 copies of t enter the membranes 
with label m +1, or s —2 copies of t enter the s — 2 membranes with label 
m-+1, and 1 copy of t enters the membrane with label 0. Using rule C7, 
a membrane with label 0 is separated into two membranes, which contain 
object t and object dnim+4, respectively. If 6 has no solution, then no object 
enters the membrane labeled by 0 and rule C7 is not applied. 


e Output phase: 


a [ dntm+5 lo- [ ] no, 
O2.[ d n+m+5 ae > [ J m42¥es, 
03.[ no], — [ ],no, 
O4.[ yes}, + [ eyes. 


If 8 has solutions, then at step n + m +5, object dy4+45 in the membrane 
with label m + 2 ejects yes into the skin and then into the environment. It is 
the (1+ m-+7)th step of the computation. If 6 has no solution, then after 
n+m-+5 steps object dy4m+5 ejects object no into the skin and then into the 
environment. 


The following theorem shows how membrane merging (go) can be used 
instead of rules (bo) to solve SAT. 


Theorem 2. P systems with rules of types (ao), (co), (go), (hg) can solve SAT in 
linear time in a confluent way. 


Proof. We construct the P system 
= (0,H, pt, Wo, W1, Ws, R), with 
ae di |0<i<m+2n} 


U {ti,j, bedi da | 1<i <j < n} 
U {c; |1 <i< m}U {d’,t, yes,no}, 


z= Jol lily 
ws =A, 
Wo = do, 
Ww, = do, 


H = {s,0,1,---,m+2}. 


We reuse rules of the generation phase and rule C1 in Theorem 1, and we 
replace the remaining part of the construction with: 
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e Checking phase (continued): 
C2.| leeall lect > | [Peer 


Using the merging rule as above, in at most m steps, all the membranes 
corresponding to a solution of 6 (assuming f has s solutions, 1 < s < 2”) 
are merged into a single “solution” membrane with label m + 1, which will 
contain s copies of objects d, and di,. 


C3. dm+2n —> dod’ ie 
C4.10], > [U"],[O-U"] 
where U” = {d’}. 


m+ 1’ 


The counter object di+2, from the membrane with label 0 is rewritten into 
dod’, and separated into two membranes with labels 0 and m + 1, containing 
objects d’ and do, respectively. By using rule C4, the latter membrane is 
merged with the “solution” membrane, if 6 has solutions. 


C5. | O Dee 7 [ ul" Lael o=u" 
where U’”’ = {dy,di,}. 


m+2/ 


If membrane m+ 1 contains at least one object d, or d',, then there is a 
solution for 6, and we can separate into two membranes, with label m + 
1 which contains objects d,,,d/,, and one with label m + 2, which contains 
object dp. The object dp evolves into dy. 


C6.| dg > dy lgeed® 


If there is no solution for 6, then the merging rule C5 is not applied. In this 
case, rule C6 will be applied, and object do evolves to dj. 


e Output phase: 


S 


[ 
03.[ no], > [ ],no, 
O4.{ yes |, — [ ] yes. 


If 6 has no solutions, then at step m+ 2n + 2 the object d; from the 
membrane with label m + 1 ejects object no into the skin and then into the 
environment. If 6 has solutions, then after m+ 2n +4 steps object d; in 
the membrane with label m + 2 ejects yes into the skin and then into the 
environment. This is (m+ 2n + 6)th step of the computation. Thus, the sat- 
isfiability problem is solved. 


; TRIANGLE 6 e December 2011 


Bio-Inspired Membrane Operations in P Systems with Active Membranes 27 


Also rules for the release of vesicle contents (ig) can be used instead of 
rules (cg) in the following way. 


Theorem 3. P systems with rules of types (ao), (go), (hg), (io) can solve SAT in 
linear time in a confluent way. 


Proof. Following the generation phase of Theorem 1, and the checking phase 
of Theorem 2, we replace the output phase of the construction by: 


e Output phase: 
O1.[ d; — d’ no | 
O2.[ d; — d! yes | 
03/0 | vag —> [UW | 
where LU” = {d’}, 
OF) Ol y.c3 1" asl OO |e 
where U” = {d’}, 

05.[[0].], > [ ].0. 


Ss 
If 6 has no solution, then the counter object d; in the membrane with label 
m +1 is rewritten into d'no and separated into two membranes, one with 
label m +1, which contains object d’, and one with label s, which contains 
object no. At the (m+ 2n+4)th step, rule O5 is applied, thus releasing 
object no into the environment. If 6 has solutions, then the counter object 
d, in membrane with label m + 2 is rewritten into d/yes, and then separated 
into two membranes. The membrane with label s will contain object yes. 
After m +2n + 6 steps, the object yes is released into the environment by 
applying rule O5. 


a POS |e 


ale 


4 Conclusions 


We have considered several new types of rules for membrane handling: (g0) 
membrane merging, (49) membrane separation, and (ig) membrane release, 
common in cell biology. 

These types of rules could also be used in neural-like networks of mem- 
branes because naturally crowded chemicals in a neuron are transmitted 
through an axon, and released in to the cleft of the synaptic connections of 
neurons package by package in vesicle formation and uptaken by neurons 
from the cleft. 

The following problems require future work: What is the power of P 
systems that use particular combinations of rules of types (go), (io), and 
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(ig) with other rules, and primed versions of these rules? For instance, can P 
systems with rules (49, bo, co, do, €0, 9, No, i9) solve SAT in linear time? What 
are the versions of the rules of types (go), (ho), and (ig) for non-elementary 
membranes? 
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Summary. Starting from the model proposed by means of Grammatical Evolution, 
we extend the applicability of the parallel and cooperative searching processes of 
Evolutionary Algorithms to a new topic: Tree Adjoining Grammar parsing. We 
evolved derived trees using a string-tree-representation. We also used a linear match- 
ing function to compare the yield of a derived tree with a given input. The running 
tests presented several encouraging results. A post running analysis allowed us to 
propose several research directions for extending the currently known computa- 
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1 Introduction 


Evolutionary Algorithms (EAs), mainly probabilistic searching techniques, 
represent several converging research areas that have their roots in the 
1960s. They were introduced by Hans-Paul Schwefel, Holland and De Jong. 
Despite the fact that various sub-domains of EAs, Evolutionary Strategies 
(ESs), Genetic Algorithms (GAs), Evolutionary Programming (EP), Genetic 
Programming (GP), etc. appeared as separate research domains they all have 
a basic common structure and common components. A searching space and 
a coding scheme representing solutions for a given problem, a fitness func- 
tion, and operators to produce offspring and select a new generation are the 
main components of EAs. Common terms such as individuals that group 
together the coding scheme and the fitness function, population of individ- 
uals, and sub-populations are used in all EA sub-domains. EAs try to solve 
searching problems by mimicking natural principles of selection and sur- 
vival of the fittest individual from a population. Real world applications 
of EAs deal with maximizing or minimizing objective functions such as 
resource location or allocation optimization. EAs are usually used for large 
searching space problems for which they find efficient solutions to problems 
that in general require a large amount of computation time. Distributed Evo- 
lutionary Algorithms (DEAs), parallel and cooperative searching processes 
have been proposed as an extension of centralized EAs in order to avoid 
premature convergence and to solve problems faster. 

EAs have also been used for automatic program generation. GP, in par- 
ticular, was used to generate target Lisp code. Grammatical Evolution is a 
new approach proposed by O’Neill in [13] that uses Context Free Grammars 
to automatically evolve computer programs in arbitrary languages. 

Starting from the model proposed by Grammatical Evolution, we ex- 
tended the applicability of the parallel and cooperative searching processes 
of Evolutionary Algorithms to a new topic: Tree Adjoining Grammar pars- 
ing. We evolved derived trees using a string-tree-representation. Implement- 
ing a linear matching function to compare the yield of a derived tree with 
a given input we obtained several encouraging results during the running 
tests. 

Due to the high complexity of some classical parsing algorithms, long 
sentences analysis could be a very difficult task for a computer program. 
Evolutionary Algorithms used for parsing are able to process long sentences 
due to their reduced computational complexity. In one of our examples, we 
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implemented a linear complexity fitness function and in conjunction with 
the global complexity of the whole EA, in comparison with the O(n°) which 
is the complexity of the classical parsing algorithm for the same formalism, 
we increased the limit of the parsed words per sentence. 

At the end of the paper we present a post running analysis that al- 
lowed us to propose several research directions for extening the currently 
known computational mechanisms in the mildly context sensitive class of 
languages. 


2 Basic aspects of evolutionary algorithms 


Various sub-domains of EAs developed a more or less rigorous theory to ex- 
plain why the algorithms perform so well when they solve searching prob- 
lems. For genetic algorithms there are several hypotheses that try to explain 
the partial results obtained. According to Goldberg [7] who introduced the 
schema theorem - the lower the number of symbols in the alphabet used for 
the coding scheme, the higher the implicit parallelism was, so an adequate 
coding scheme for GAs should use a binary alphabet. Studies [12] with 
high cardinality alphabets revealed that using or real codification had unex- 
pected advantages(in particular, they made it possible to introduce new and 
stronger genetic operators such as the average crossover which performs 
better). In [8], Goldberg developed a new theory about how representations 
with a high number of symbols could perform better. Very simple fitness 
models are used for Evolutionary Strategies in order to obtain analytical 
results. 

Generally, there are some unknown aspects regarding the most suitable 
EA operators for a given searching problem. There is a large set of recom- 


WM 


bination operators such as “one point crossover”, “two points crossover”, 
“shuffle crossover”, “average crossover”, “uniform mutation”, “normal mu- 
tation”, “step by step mutation”, etc. Also the select next generation oper- 
ator has many variations such as: “roulette wheel selection”, “elitist selec- 
tion”, “disruptive selection”, “rank space selection”, etc. We can find more 
details about the mentioned operators in [2], [3] and [18]. How can we 
know which is the best operator set capable of solving a given problem 
faster? How can we compare the results of different Evolutionary Algo- 
rithms when the initial population is randomly generated and the opera- 


tors act randomly? Using the new paradigms of distributed Evolutionary 
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Algorithms, new problems arise. Do distributed EAs perform better than 
centralized EAs? How can we compare the performances of distributed or 
centralized EAs? The answers to these questions depend on the problem 
that EAs try to solve and up to now, due to the lack of a rigorous theory 
explaining the basic aspects, empirical results are the only way of shedding 
some light on this area. 


2.1 Biological inspiration of evolutionary algorithms: common terms 


Evolutionary Algorithms were inspired by biological models so the termi- 
nology used for different data structures or procedures uses the biological 
terms. It is somehow surprising that a list of common terms used in Evolu- 
tionary Algorithms and Genetics are explained using a single definition and 
not as terms with different “algorithmic meaning” or “biological meaning”. 
The explanation relies on the fact that the common list of terms uses sev- 
eral basic terms such as “generation”, “individual”, “population”, etc. the 
meaning of which is clear from the biological or computational points of 
view. 

The following list presents only the most important terms used both in 
biology and Evolutionary Algorithms. 

A gene is a physical and functional unit of heredity that carries informa- 
tion from one generation to the next. 

The chromosome is the structure that carries the genes. 

Locus means the location of a gene on a chromosome. 

An allele represents one of the different forms of a gene that can exist at 
a single locus. 

The genome is the sum of all the genetic material in a chromosome set. 

The genotype means the specific allelic composition of a certain gene or a 
set of genes. 

The phenotype is the visible or measurable characteristics of a genotype. 

Mutation is a change of a gene. 

Crossover means the exchange of genetic material between maternal and 
paternal chromosomes. 

Fitness in population represents the ability of a particular genotype to 
reproduce itself compared to all other genotypes. 

Epistasis appears when a gene expression is controlled by another gene. 
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2.2 Theoretical approach on evolutionary algorithms 


A very good overview of Evolutionary Algorithms theory can be found 


in 


[1]. Briefly, an Evolutionary Algorithm may be defined as a 7-tuple 


EA = (1,®,11,A,Q,5,t) (1) 


where: 


I represents the set of the searching space instances usually called in- 
dividuals. Sometimes associated with individuals we can keep useful 
information for genetic operators. 

® is a fitness function associated with individuals. 

ve denotes the number of parents. 

A is the number of children in the population. 

© is a set of genetic operators which produce new A children when 
applied to the parents. 

s is the selection operator that changes the number of individuals from 
parents and children to produce the next generation of parents (I*+* > 
I). The selection operator may also consider that after one generation 
the parents have completed their task and thus s selects only from the 
population of children (I* + I"). 

t represents the stop criterion which may be “Stop when a good enough 
value is reached by an individual fitness function”, “Stop after a certain 
number of generations”, “Stop if the population converged to a single 
individual”, “Stop after a maximum time available for computations”, 
etc. 


A general Evolutionary Algorithm can be described as follows, where 


gen represents the generation number and s, ¢ are, respectively, the selection 
operator and the termination criterion. 


The structure of an Evolutionary Algorithm is: 


gen:=0; 
initialize with random values ae evaluat P(0):= 
{( i 1(0),®( i 1(0))),.--/( t (0), PC 7 p(0))) I: 
repeat 
apply genetic operators and evaluate (P(gen))— P’(gen)= 


{( 7 4(gen), (74 (gen))), 1 Fh (gen), ®(7 4 (gen))) } 
select the next generation P(gen+1):=s(P(gen),P’(gen)); 
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gen := gen+l; 
until (#(P(gen))); 


2.3 Schema theorem for genetic algorithms 


The schema theorem presented in [7] assumes that a genetic algorithm 
works with a binary coding. First, we add a new simbol * to the binary 
alphabet {0, 1} which means “matches with both 0 and 1 symbols”. Strings 
formed with symbols from the extended alphabet {0, 1, *} are called 
schemata. A schema describes a set of binary string. As an example, the 
schema *10* represents the set {0100, 0101, 1100, 1101}. For a binary string 
of length / there are 3! schemata and in general for strings over alphabets of 
cardinality k there are (k +1)! schemata. 

Let us consider a schema H taken from the alphabet {0, 1, *} whose 
length is |. We consider that individuals inside the algorithm have the same 
number of | bits. An individual in the population represents 2’ schemata 
while on every position we may have the actual value or *. So ina population 
of n individuals, there may be n - 2! schemata. We can now define a schema 
order, denoted by o(H) that is the number of fixed positions (characters 
other than the * symbol) in the schema. The length of a schema denoted by 
6(H) is the distance between the first and the last character other than the *. 

Assume that we have A(t), a population A at generation t, and we use 
the notation m(H,t) if there are m particular schema H inside the popula- 
tion. During the selection of a new generation a string is copied with the 
probability p; = oe, where fj is the fitness of the particular string and ) fj 


is the sum of the fitness function inside the population. We write f(H) the 
average fitness of the strings representing schema H and with f the average 
fitness of a population, that is f = Ea We can deduce that after the selec- 


tion of a new generation of n individuals the multiplicity of the schema H 
is: m(H,t+1) =n- p;-m(H,t). That is 


A 
m(H,t+1)= Presi ces 
f 
The formula above shows that the number of schemata that are below 
the average fitness decreases in the subsequent generation, and the number 


of schemata that are above average fitness increases during the next gener- 
ation. 


; TRIANGLE 6 e December 2011 


Applications of Evolutionary Algorithms in Formal Languages 35 


During the crossover operation, schema H is destroyed with the proba- 
bility pg = ted) so schema H will survive with the probability p; = 1— pq. 
If we consider that for a given individual there is a crossover probability p, 


then the previous formula becomes: 


lo 


(H) 
Ps = 1— Pec° T-1 

We used “>” instead of “=” because the schema H might appear in the 
population after a crossover of different individuals that do not represent 
the schema H. 

For the mutation operator the probability of surviving for a schema H 
depends on every fixed surviving position. For one position the probability 
of surviving is (1 — pm). Therefore, for the whole schema, which has 0(H) 
fixed positions, we have a surviving probability (1 — parr, which for 
1 >> pm may be approximated by the expression (1 — pmo(H)). 

Now, considering all the effects of selection, crossover and mutation, and 
by multiplying the respective expressions, we obtain: 


6(H) 
f=1 


H 
mH, t-+1) > m(H DAE ) -|1—pe- 0(H)- pm 
The interpretation of the above formula, known as the schema theorem, 
is: short, low order, above average schemata receive exponential surviving 
chances in the subsequent generations. 


2.4 Distributed evolutionary algorithms 
We consider the distributed EAs as 9-tuples 
DEA = (1,®,p,A,0,s,t,n, 5) (2) 


where: 


e I,®,u,A,Oand s are the same as for EAs. 

e we consider t = (t¢(impSol), te(maxGen)) where t-(impSol) means “ter- 
minate when a solution is found that is equal to or better than imp- 
Sol”, and te(maxGen) means “terminate when exceeded maxGen gener- 
ations”. 

e nrepresents the number of distributed populations that evolve in paral- 
lel during the distributed EA. 
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e Gis an operator that exchanges individuals among populations. 


In order to describe the structure of DEAs, we use the notation ||procP for 
“parallel execute procP”. 


The structure of our implemented 
Distributed Evolutionary Algorithm is: 
||initialize with random values and evaluate P,(0),...,P,(0); 
||gen, :=0,..., gen := 0; 
||repeat for every population 
apply genetic operators and evaluate 
(Pi(gen)) — Pj (geny),..-,(Pu(genn)) + Pa(genn); 
apply exchange of individuals between populations 
(Pr, Pm) for some k, min 1..n; 
select the next generation P; (gen, + 1) := s(Pi(gen1), P}(gem1)),..., 
Pn(gertn +1) := s(Pu(genn), Py (genn)); 
gen, := gen; +1,...,geNy := Benn +1; 
until (4k s.t. (t¢(impSol) for P.(genx)) OR 
(te(maxGen) for P;(gen;) AND...te(maxGen) for Pu(genn)); 


— 


Studying the distributed evolutionary systems we observe the existence of 
the following hierarchy of objects: individual, subpopulation, population, 
distributed population, etc. 

Naturally several questions arise: 


e Is this hierarchy complete or can we define more and more complex 
levels? 

e What part of the above hierarchy might be computed in a centralized 
way and what part in a distributed way using multithreading, multipro- 
cessors, computer networks? 


The answer to the first question is that we can imagine defining more 
and more complex levels and we can use dedicated terms such as clusters 
of distributed populations, super clusters, etc. In order to simplify the terms 
used in distributed EAs, we propose the following terminology (Table 1): 

We can imagine different distributed EAs computing architectures. We 
mention that after a centralized layer there is no point for superior lay- 
ers processed in a distributed way (the gain in speed coming from the 
distributed processing would be canceled by the synchronization with the 
lower centralized levels). In Figure 1 we present the possible interactions of a 
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Level 0 |Individual 

Level 1 /Subpopulation 

Level 2/Population 

Level 3 /Distributed Population 
Level 4 


Level n 
Table 1. Hierarchy of distributed EAs objects 


centralized EA layer with other layers. We also note the the highest process- 
ing layer being responsible for the algorithm itself needs to be centralized. 


exchanges 
with other 
distributed 
EA levels 


Individual 


subpopulation | ——— 


Fig. 1. Centralized EA that have connections with other distributed EA levels 


Using just levels instead of hierarchical terms for the population layers 
we can imagine the following classes of distributed EAs (Table 2). (Table 2): 


Based on the previous considerations we introduce a new notation for 


distributed EA classes. We denote by (n, m), the class of a distributed EA, 
where n is the total number of the EA layers, and m is the number of the 
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Level 0 Level 1 Level 2 Level 3 Level 4 S38 Level n 
Distributed | Distributed | Distributed | Distributed | Distributed | Distributed | Distributed 
Centralized | Distributed | Distributed | Distributed | Distributed | Distributed | Distributed 
Centralized | Distributed | Distributed | Distributed | Distributed | Distributed 
Centralized | Distributed | Distributed | Distributed | Distributed 
Centralized | Distributed | Distributed | Distributed 


Centralized | Distributed 


Table 2. Possible distributed EAs computing architectures 


distributed levels. As an example, the distributed GA implementation where 
individuals are decentralized objects belongs to class (1, 0). 

Usual distributed EAs implementations deal with (n, 3) classes. Figure 1 
is also an example of such distributed EA architecture. 


2.5 Evolutionary algorithms used in problem optimization, first example 


When we try to solve a problem using EAs, we should first design the search 
space. Depending on the number of dimensions and the searching precision 
we automatically get the genetic structure of individuals. In almost all cases, 
individuals in an EA represent solutions for the given problem and the 
fitness function evaluates how good the solution is. Finally, choosing the set 
of control parameters is a matter of personal experience and intuition. 

The game of TicTacToe is played between two partners on a board as 
we can see in Figure 2. Every partner marks the squares of the board with 
distinct symbols. The goal of the game is to obtain a line (horizontal, ver- 
tical or diagonal) of a certain length marked with the same symbol. In our 
implementation one of the partners is an Evolutionary Algorithm that auto- 
matically proposes the next move. 

In the case of the TicTacToe game, what would be a solution for a given 
situation on the board? We consider that individuals are formed from 2 
genes, a starting coordinate and a direction to follow. How would one indi- 
vidual be evaluated with respect to the proposed coding? Suppose that EA 
plays with X. The value of an individual is the sum of already marked X 
values. We assign negative values for 0 symbols in the individual’s “body”. 
In order to increase the “intelligence” of the proposed solution we consid- 
ered not the best individual as the next move, but the intersection of several 
best individuals proposed by the algorithm. 

More details regarding the implementation are not within the scope 
of the current article. The implementation of the EA that plays TicTacToe 
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| Direccién a C:\Horia\JavalJdev\TicTacToe|Server\public_html\TicTacToe,html 
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Info: Cancel by user request 


(B) Subprograma gameServer,playTheGame started 


Fig. 2. An implementation of a TicTacToe game partner using an EA engine 


may be tested on the web address http://grammars.grilmc.com/GRLMC/ 
PersonalPages/AdrianHoria/tictactoe.html. 


2.6 Grammatical evolution 


Grammatical Evolution is a new approach proposed by O’Neill in [13] 
that uses Context Free Grammars and Genetic Algorithms to automatically 
evolve computer programs in arbitrary languages. Despite the fact that the 
goal sounds very ambitious, the implementation has several limitations. 

In paper [13] the EAs find a function of one independent variable and 
one dependent variable in symbolic form, which fits a given sample of 20 
data points (x;, y;). The quadratic polynomial function x* + x3 + x* +x with 
points from the interval [—1, 1] was used. 

The grammar used was the following: 
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1) expr > t= < expr >< op >< expr > (0 
| (< expr >< op >< expr >) (1 


( 
( 
| < pre—op > (< expr >) 


| < var > 3 
2) < op > = | = |/|* 
3) < pre —op > ::= sin|cos|tan|log 
4) < var > n= X, 


The algorithm constructs a symbolic expression using the sentential 
form. The genetic coding is a string of bytes. First the algorithm starts with the 
starting symbol (< expr >) and it expands the leftmost symbol considering 
the gene value mod the number of choices. Then the next leftmost symbol 
and the next gene are used. If there is only one choice then the symbol is ex- 
panded without considering the gene value. This procedure continues until 
all the nonterminals in the sentential form were expanded. If the string of 
genes is exhausted before the nonterminals in the sentential form, then the 
string of genes is used once again from the beginning as if it were a circular 
string. 

The fitness function evaluation promotes a multicriterial optimization, 
which maximizes the number of fitting points and minimizes the error 


20 
d If) =i. 


This approach is also useful in parsing . We can observe that GE cannot 
really evolve programs, only functions specified by samples. 


3 Evolutionary algorithms for tree adjoining grammatical 
evolution 


Context-free grammars (CFGs) are a well known class of grammars that 
are extensively used for programming languages and they can also de- 
scribe almost all structures of natural languages. Yet in shuch cases as 
multiple agreement languages {aa} ...a?|n > 1,k > 3}, copy languages 
{ww|w € {a,b}*} and cross agreement {a"b"c"d™|n,m > 1}, context-free 
grammars are not the most appropriate investigation instrument for natu- 
ral language analysis. Tree-Adjoining Grammars, or TAGs for short, were 
introduced by Joshi, Levy and Takahashi in 1975 to model some linguistic 
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aspects. Tree-Adjoining Grammars (TAGs) are an important class of gram- 
mars, originally motivated by linguistic considerations, which subsequently 
yielded important mathematical and computational results, which in turn 
had linguistic implications. A.K. Joshi and Y. Shabes published an overview 
of TAGs in [11]. 

Parsing algorithms play an important role in the implementation of com- 
pilers, interpreters for programming languages and natural language pro- 
cessing. Parsing usually refers to the construction of a derivation tree. It 
is also possible to decide if a string belongs to a given language or not 
(the membership problem) without constructing the derivation tree. Numer- 
ous parsing algorithms have been developed over the years. Two of the best 
known parsing algorithms for CFGs are the CYK recognizer [5] and the 
Earley parser [4]. 

A recognizer is an algorithm that takes a string as input and either ac- 
cepts it or rejects it, depending on whether it belongs to the language of a 
grammar or not. In Figure 3 we can see a recognizer data flow. 


Grammar, Input String Recognition Accepted/Rejected 
Algorithm ———— 


Fig. 3. A recognizer data flow 


A parser is a recognizer which also outputs the derivation trees if the 
string is accepted by the grammar. 


3.1 Definition, components and composition operations in tree adjoining 
grammars 


Definition 1. A tree-adjoining grammar is a 5-tuple (T,N, 1, A,S) where: 


1. T is the alphabet used to build up a language (finite set of terminal symbols). 
2. N is the set of non-terminal symbols (variables). 
3. S is the start non-terminal symbol of the grammar. 
4. I is a finite set of finite trees, which are called initial trees, and they have the 
following features: 
e the interior nodes are non-terminal symbols. 
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e the nodes on the frontier of the initial trees are terminal or non-terminal 
symbols; the non-terminal symbols on the frontier which can be substituted 
are marked with a down arrow (|). 

5. A is a finite set of finite trees, which are called auxiliary trees, and they have 
the following features: 

e interior nodes are non-terminal symbols. 

e the nodes on the frontier are terminal or non-terminal symbols. 

e The nodes on the frontier are marked for substitution (|) except for the 
foot node (annotated with an asterisk *). The label of the foot node must be 
identical with the label of the root node. 


A TAG where at least one terminal symbol (anchor) appears at the fron- 
tier of every initial or auxiliary tree is called a lexicalized TAG. 

The trees in I A are called elementary trees. 

The trees with roots labeled by the nonterminal A are called A-type trees. 

We can associate a node address (Gorn-position) with every node in a 
tree in an inductive way. The root node has the empty address. For a child 
node we take the parent address and we add a dot and then the number of 
children counted from the left. In the set of elementary trees we can form a 
global address using tuples formed by (treeName, node Address). 

There are also alternative ways to assign addresses to nodes of trees. 
For instance, we can assign sequential numbers to nodes by traversing re- 
cursively the tree in a root-left-right or left-root-right manner. We can even 
count the nodes in all the trees sequentially. 

TAGs operate with two composition operations, adjoining and substitu- 
tion. 


Definition 2. Substitution is an operation that takes a nonterminal node A marked 
for substitution by a down arrow (J.) which is located on the frontier of a tree and 
replaces it with a tree from the initial trees whose root has the same label as the node 
A. 


Definition 3. Adjoining builds a new tree from an auxiliary tree B and another 
tree «, which can be initial, auxiliary or a derived tree. Let us consider that the 
root node of the auxiliary tree B is labeled by X (also the foot node, by definition). 
If an internal node of the tree a is labeled by X then the adjoining operation will 
construct a tree as follows: 


e the sub-tree of « dominated by X, call it t, is removed from the tree x, leaving a 
copy of X behind. 
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e the auxiliary tree B is attached to the copy of the node labeled by X in the excised 
tree a. 

e the sub-tree t is attached to the foot node of B and the root of t is identified with 
the foot node of B (they have the same label X). 


By definition, any adjoining on a node marked for substitution is forbidden. 


For linguistic reasons, we need a more precise way to specify which 
auxiliary tree can be adjoined at a given node. Therefore several constraints 
on adjoining were introduced. 


Definition 4. In a TAG G = (T,N,I,A,S), for each node of an elementary tree 
on which adjoining operation is allowed we can specify one of the following three 
constraints on adjunction: 


e Selective Adjunction written as SA(AT) specifies a set of trees AT C A, the 
set of auxiliary trees that can be adjoined in a given node. 

e Null Adjunction written as NA forbids any adjunction on a given node. We 
have NA=SA(@). 

e Obligatory adjunction written OA(AT) specifies a set of trees AT C A, from 
which one of the trees is mandatory to be inserted on a given node. 


If there are no substitution nodes and no constraints, then we have a pure 
TAG. 

After applying the adjunction or the substitution operations we obtain 
the derived trees. They do not have the information about how they were 
built, so we need to build a special structure that can specify how a derived 
tree was constructed. 


Definition 5. A TAG derivation tree is a tree used to show how a derived tree was 
constructed and it has: 


e aroot labeled by an S-type initial tree. 

e all other nodes are labeled with trees and parents’ nodes addresses where the 
composition operation (substitution / adjoining) has been performed. 

e the arcs in the derivation tree connect a node labeled with (tree, parent’s node 
address) with the parent tree. The substitution arcs are dashed and the adjoining 
arcs are continuous. 


An initial tree is completed if there is no substitution node on its frontier 
and if all the obligatory adjunction constraints are satisfied. 

The tree set, Tg of a TAG is defined as the set of the completed initial 
trees derived from some S-rooted initial tree. 
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The string language, L(G), of a TAG is the set of yields of all the trees in 
the tree set Tg:L(G) = {w|w = yield(t), t € Tg} 


3.2 Lexicalized grammars 


Lexicalized grammars, presented in [11] by Joshi and Shabes, have both 
linguistic and formal importance. 


Definition 6. A grammar is lexicalized if it consists of: 


e a finite set of structures, each one associated with a lexical item called the anchor 
of the corresponding structure. 
e one or more operations for composing the structures. 


The anchor must not be the empty string. 
Proposition 1. Lexicalized grammars are finitely ambiguous. 


We observe that a finite sentence has a finite number of lexical items and 
hence a finite number of structures attached to the lexical items. The finite 
number of structures may be combined in finitely many ways to produce 
compound structures. Therefore we have a finite number of derivations that 
produce the initial sentence. For this reason we have: 


Proposition 2. It is decidable whether or not a string is accepted by a lexicalized 
grammar. 


Definition 7. We say that a formalism F can be lexicalized by another formalism 
F’, if for any finitely ambiguous grammar G in F there is a grammar G' in F' s.t. 
G and G' generate the same tree set and G’ is lexicalized. 


In general CFGs are not in lexicalized form because not all the rules contain 
a lexical item on the right hand side and sometimes CFGs can be infinitely 
ambiguous, containing recursive derivation chains such as X = >* X. Lex- 
icalization of finitely ambiguous CFGs achieved by transforming them into 
Greibach Normal Form [5], can be regarded as _ weak lexicalization since we 
do not preserve also the structure of the derivation trees. The above defini- 
tion of lexicalization may be regarded as strong lexicalization. 


Definition 8. A tree-substitution grammar (TSG) is a TAG without auxiliary trees 
and without the adjoin operation. 
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We use the same terminology for derived trees and derivation trees. A tree 
is completed if it has only terminals on its frontier. We denote with ty an 
X-type initial tree and with Fr(tx) the frontier of the tree t. 

The set of languages generated by TSGs is the same as the set of lan- 
guages generated by CFGs. It is easy to see that for a TSG there is an equiv- 
alent CFG that generates the same language. For any initial tree ty in the 
TSG we write a CFG production of the form X — Fr(tx). For the other 
inclusion we take a CFG grammar and for any production, we construct an 
initial tree with the root the nonterminal in the left hand side of the pro- 
duction, having the children the terminals and non-terminals from the right 
hand side of the production. The non-terminals on the frontier of the initial 
tree are marked for substitution. 

The following propositions are well known from literature [11]. 


Proposition 3. Finitely ambiguous CFGs cannot be lexicalized by a TSG. 


Proposition 4. A finitely ambiguous CFGs which does not generate the empty 
string may be lexicalized by a TAG without substitution nodes. 


We observe that adjunction is sufficient to lexicalize CFGs, but using 
substitution as an additional operation we may obtain more compact TAGs. 


3.3 A classical tree adjoining grammar recognizer 


We present a version of an Earley algorithm for TAGs described by Joshi 
and Schabes in [11]. The original algorithm uses a chart of items. Every 
item contains a dotted tree and the dot may be in one of the following posi- 
tions with respect to a node in the tree: left-and-above, left-and-below, right- 
and-below, right-and-above. Because of this representation, there might be 
several equivalent items in the chart with different representation as we 
can see in Figure 4, for every dotted tree there can be two equivalent items 
in the chart whit right-and-above and right-and-below equivalent dot posi- 
tions on the left picture and right-and-above and left-and-above equivalent 
dot positions on the right picture. 

We introduce a shorthand notation for tree structures namely, string rep- 
resentation for trees, in order to simplify tree descriptions. For the defini- 
tions and notations related to N, T-trees we refer the reader to [6]. An N, T- 
tree can be represented as a string by the mapping 
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S aS 
: | wl, 
. a 


b GNAx Cc 


Fig. 4. Equivalent dot positions 


str : Tn(T) > (Ng UT)*, defined as 
str(ta()) =a, a€ TU {A} (a single-node tree labeled a), 
str(ta(t1,---,tm)) = Astr(t,)---str(tm) A, AEN 
(a tree labeled A with subtrees t1,--- ,t). 


where Ng = NUN, N= {A|A€EN}. 

If we can rapidly distinguish the nonterminals from the terminals by us- 
ing, respectively, uppercase letters and lowercase letters from the alphabet, 
then we can simplify the notation even more, and use only closed. brackets 
instead of bar symbols : ie. str(t4(t1,:-: ,tm)) = Astr(t))---str(tm) |, A € 
N. 

Using the string-representation for trees we reduced the state description 
for an item because instead of a dotted tree and a position of the dot we have 
only a dotted string-tree. Additionally, the problem of equivalent dotted 
items disappears, as we can see the trees in the Figure 4 have the following 
representation: Se.] and SN4a.SbSN4 «|c}d]. 


Informal description of a TAG (LTAG) recognizer 


In an attempt to reduce the number of trees in grammars, practical consid- 
erations imposed the usage of lexicalized TAGs (LTAGs) instead of TAGs. 
For the algorithms described in this paper the Tree Adjoining Grammars 
should not be lexicalized but the implementations could benefit from the 
fact that the grammars are lexicalized. 

We define an item s as a 7-tuple, 


s = |treeName, dottedStringTree, i, j,k, 1, sat?] 


where: 
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treeName is the name of an elementary tree. 

e dottedStringTree is the string-representation of the tree treeName. 

i,j, k,l are indices of positions in the input string ranging between -1 and 
n, n being the length of the input string. Only the indices j and k may 
have the -1 value and this means that they are not bounded. The index | 
is used to point the last analyzed character in the input string. 

e sat? is used to disallow more than one adjunction in the same node. 
sat? takes values ranging from {nil, true}, where nil means no adjunc- 
tion has yet been performed on the dotted node and true means that an 
adjunction has been performed on the dotted node. 


According to the recognizer algorithm’s necessities, the dot may be po- 
sitioned in front of the foot symbol (*) or after. 

Initially, for all S-type initial trees «;, chart C contains all items of the 
form: 


[a;,".” +a;StringTree,0, —1, —1,0,nil] 


Depending on the items in chart C, new items are added to the chart. 
According to the items in the chart four basic operations add new items: 
PREDICT, SCAN, COMPLETE and ADJOIN to which work basically as de- 
scribed in [11]. The algorithm stops in two cases: 


e if no items can be added to the chart then the input string was not 
recognized. 
e if an item of the form: 


[x, «StringTree+”.”,0, —1, —1,n, nil], 


where « is an S-type initial tree is to be added then the input string was 
recognized. It is normal for indices j and k to have the value -1 since in 
the item we refer to an initial tree without a footer node 


Formal description of a TAG (LTAG) recognizer 


We use the index in the string-tree representation as the nodes’ adresses and 
the function pos to return the node number in the string-tree representation. 

We use the following notations: Var is the set of variables; Term is 
the set of terminals; StartingSymbol is the value of the starting symbol; 
trees is the set of all elementary trees; for a given tree t we write str(t) 
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Functional |After|Current Supplementary 
description |dot |item conditions 
Scan 1 a (1, s1+“.a"+s2, JinputString[1+1]=a 
i, j, k, 1, nil) 
Added item (t1, s1+“a.”+82, i, j, k, 1+1, nil) 
Scan 2 A (f1, s1+”.”+A+s2, 
i, j,k, 1, nil) 
Added item (f1, S1+A+”.” +82, i, j, k, 1, nil) 
Predict 1 V (f1, s1+”.”+V+s2, |for each at € 
i,j, k, 1, nil), Adj(t1, pos(V)) 
Added items (at, “.”+str(at), I, -1, -1, 1, nil) 
Predict 2 V \(#1, $1+.”+V+s2, |OA(V) = false 
i, j, k, 1, nil), 
Added item t1, s1+V+".”+8s2, 1, -1, -1, 1, nil 
Predict 3 = (f1, s1+V+".*"+s2,|for each t in trees, 
1, -1, -1, 1, nil) {for each V in str(t), 
{ str(t)=s3+V+s4 
t1 € Adj(t, pos(V)) 
i 
Added items (t, s3+V+".”+s4, 1, -1, -1, 1, nil) 
Complete 1 |] (f1, s1+”.]’+s2, |for each if in C 
i,j, k, 1, nil) it=(t2, s3+V+".*]"+s4, 
i,-1, -1, i, nil, 
(814%. ]”)=V 
Added items (#2, s3+V+“*.]"+s4, i, i, 1, 1, nil) 
Complete 2 |] (f1, s1+”.]”+s2, |for each it in C 


i,j, k, 1, nil) 


it=(f1, s3+“.”+V+s4, 
h, j’, k’, i, sat? ) 
(s1+".])=V 


Added items 


(#1, s1+“].”+s2, h, max(j, j’), max(k, k’), 1, sat?) 


Adjoin 


(1, 
V+slt*"4+s247.", 
i,j,k, 1, nil) 


for each it in C 
it=(t2, 

s3+”.]” +84, 

ip, qk, nil) 

+ (s3+”.]”)=V, 

t1 € Adj(t2, pos(V)) 


Added items 


(t2, s3+”].”+84, i, p, q, 1, true) 


Table 3. New items added to the chart 
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the string-tree representation of the tree t!; NullAdjoining is the set of 
pairs (treeName, nodeAddress) for which the Null Adjoining attribute is set; 
Adj(treeName, node Address) is the set of auxiliary trees that can be adjoined 
in the treeName at the address nodeNumber; inputString is the value of the 
string to be recognized and we have the inputString = a,...an; root isa 
function that has one argument as a string-tree t and returns the nontermi- 
nal character that is the root of the tree t; max(i,/) is the maximum value 
between i and j; openBracket is a function that returns the non-terminal cor- 
responding to a certain closed bracket in the string-tree representation that 
may be described as follows: 
counter =1 
Starting from the dot position, 
go right in the dotted string-tree, 
increment the counter for any ’’]’’, 
decrement the counter for any V in Var, 
if counter=0 then return V 
Internally the algorithm builds the initialTrees and auxiliaryTrees sets 
based on the trees set. We also use the item definition as described in the 
informal description. The recognition algorithm works with a set of items 
collected in a chart C. Also we define a procedure chartAdd that adds an 
item to the chart C only if the item is not already in the chart. 
The pseudo-code for the TAG recognizer algorithm may be described as 
follows: 
function recognize 
C=9 
for each tree in initialTrees 
if root (tree)=StartingSymbol then 
chart Add (|tree,?.’+-stringDescription (tree), 0, —1, —1,0, nil]) 
end if 
next tree 


apply for each item in C 
if (addNewltems(item)=’’stop’?) then 
return (’’recognized’’) 
end if 
until no more items are added in the chart C 


1 We can distinguish between initial trees and auxiliary trees, because auxiliary 
trees will contain the footer symbol “*” 
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return(?’*not recognized’’) 
end function 
The function addNewItems(currentItem) returns “stop” if a new item of 
the form: 


[a, xStringTree+”.”,0,—1, —1,n, nil], 


where « is an S-type initial tree is to be added to chart C. 

The function addNewItems considers the current item in the chart, and 
depending on the character that is after the dot in the dotted string tree 
description and a supplementary condition, adds new items to the chart. 

To reduce the space used by variable description we use NA and OA 
for Null Adjoining, Obligatory Adjoining sets, respectively. We will also use 
a for a terminal a € Term, V for a variable V € Var, and + for the function 
openBracket. 


A step by step example for the classical TAG (LTAG) recognizer 


We illustrate the recognizer’s work using an example. To describe the cor- 
responding TAGs, we use the formalism that we have already introduced. 


Example 1. 


Table 4 describes the TAG G2 = (N = {S}, T = {a,b,c,d,e}, 1 = {a: 
Se), A= {br Sachs «\eld)}, 5). 


Variable name Type | Value 

Var set |{S} 

Term set |{a,b,c,d,e} 
StartingSymbol value|S 

trees ny Se] 

trees p SaSbS*]c]d] 
NullAdjoining pb 15 
ObligatoryAdjoining|set — |{} 
inputString value|aabbeccdd 


Table 4. Recognizer description of the TAG G2 


The recognizer fills in the chart as described in Table 5. 
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Input Item Apply 
read No|from Value 
The positions in the input string are: 9@1a2b3b4e5cec7dgdo 

1 Jinit a, Sel, 0, -1, -1,0,n P1+2:P2+3 

2 |P1(1) B, SNAaSbSN4*]c]d], 0, -1, -1, 0, n|P2+4 

3 |P2(1) x, S.e], 0, -1, -1, 0, n S1- 

4 |P2(2) B, SNA aSbSN4*]c]d], 0, -1, -1, 0, n|S1+5 
a 5 |S1(4) B, SN4a.SbSN4*]c]d], 0, -1, -1, 1, n|P1+6:P2+7 
a 6 |P1(5) B, SNAaSbSN4*]c]d], 1, -1, -1, 1, n|P2+8 
a 7 |P2(5) B, SN4aS.bSNA*Jc]d], 1, -1, -1, 1, n|S1- 
a 8 |P2(6) B, SNA aSbSN4*]c]d], 1, -1, -1, 1, n|S1+9 
aa 9 |S1(8) B, SN4a.SbSN4*]c]d], 1, -1, -1, 2, n|P1+10:P2+11 
aa 10 |P1(9) B, SN4aSbSN4*]c]d], 2, -1, -1, 2, n|[P2+12 
aa 11 |P2(9) B, SNAaS.bSN4*]e]d], 2, -1, -1, 2, n|S1+13 
aa 12 |P2(10) |p, SN4.aSbSN4*Jc]d], 2, -1, -1, 2, n|S1- 
aab 13 |S1(11) |B, SNAaSb.SN4*]c]d], 2, -1, -1, 3, n|P2+14 
aab 14 |P2(13) |p, SNAaSbSN4 *Jc]d], 3, -1, -1, 3, n{/P3+15:P3+16 
aab 15 |P3(14) Ja, S.e], 3, -1,-1,3,n S1- 
aab 16 |P3(14) |B, SN4aS.bSN4*]c]d], 3, -1, -1, 3, n|S1+17 
aabb 17 |S1(16) |B, SN4aSb.SN4*]c]d], 3, -1, -1, 4, n|P2+18 
aabb 18 |P2(17) |B, SNAaSbSN4 *Jc]d], 4, -1, -1, 4, n{P3+19:P3+20 
aabb 19 |P3(18) |, S.e], 4, -1,-1,4,n S1+21 
aabb 20 |P3(18) 18, SN4aS.bSN4*]c]d], 4, -1, -1, 4, n|S1- 
aabbe 21 |S1(19) jw, Se.], 4, -1, -1,5,n C1(21+18)+22 


aabbe 22 |C1(21,18)|8, SNAaSbSN4* Jc]d], 4, 4,5,5,n |C2(22+17)+23 
aabbe 23 |C2(22,17)|B, SNAaSbSN4*].c]d], 3, 4,5,5,n |S1+24 
aabbec —_|24 |S1(23) 6, SNAaSbSN4*Jc.]d], 3, 4,5, 6,n |C1(24+14)+25 
aabbec = |25 |C1(24,14)|B, SNAaSbSNA* Jc]d], 3, 3, 6,6,n |C2(25+13)+26 
aabbec —|26 |C2(25,13)/6, SNAaSbSN4*].c]d], 2, 3, 6, 6,n |S1+27 
aabbece |27 |S1(26) |B, SNAaSbSN4*]c.]d], 2, 3, 6, 7,n |C2(27+9)+28 
aabbece |28 |C2(27,9) |B, SNAaSbSN4*]c].d], 7,n |S1+29 
aabbeccd |29 |S1(28) |B, SNAaSbSN4*Jc]d.], , 8,n |C2(294+6)+30 
8 
8 


1, 3, 6 
1, 3, 6 
aabbeced |30 |C2(29,6) |B, SN4AaSbSN4*Jc]d]., 1, 3, 6, 8,n |A(30+24)+31 
aabbeccd |31 |A(30,24) |B, SNAaSbSN4*]c.]d], 1, 4,5, 8, t |C2(31+5)+32 
aabbeccd |32 |C2(31,5) |B, SN4AaSbSN4*]c].d], 0, 4,5, 8,n |S1+33 

aabbeccdd|33 |S1(32) |B, SNAaSbSN4*Jc]d.], 0, 4,5, 9, n |C2(33+2)+34 
aabbeccdd|34 |C2(33,2) |B, SN4AaSbSN4*]c]d]., 0, 4,5,9,n |A(B4+21)+35 
aabbeccdd|35 |A(34,21) |a, Se.], 0, -1, -1, 9, t C2(354+1)+36 
aabbeccdd|36 |C2(35,1) |, Se]., 0, -1,-1,9,n recognized 


Table 5. Chart C after the recognition of the TAG Gp and the input string aabbeccdd 
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Complexity considerations for the TAG (LTAG) recognizer algorithm 


The presented algorithm has the worst case time complexity O(|A|-|AU1| - 
M -n°) where |A| is the number of auxiliary trees, |A U I| is the number of 
elementary trees, M is the maximum number of nodes in an elementary 
tree and n is the length of the input string. In LTAGs, we can select from 
the whole grammar only those trees that have lexical anchors in the parsing 
sentence so we can dramatically reduce the number of elementary trees used 
in the parsing process. The case complexity is worst during the ADJOIN 
operation. As we can observe in Table 3, during the adjoin operation we 
have to combine two items (#1, ..., i, j, k, 1, nil) and (#2, ...,j, p, q, k, nil). 
Therefore we have at most n° instances of indices (i, j, k,l, p, q) and we can 
call the adjoin operation |A| - |A U 1| - M times. 


3.4 Evolutionary algorithm for tree adjoining grammar parsing 


We can find an introduction to Evolutionary Algorithms and their applica- 
tions in [2] and [3] and a very good theory overview in [1]. Grammatical 
Evolution (GE), proposed by a group from the University of Limerick [13], 
combines aspects of Context Free Grammars with the searching capabili- 
ties of Evolutionary Algorithms in order to evolve high-level languages. GE 
orders the productions for every non-terminal in a CFG and then uses the 
gene values in order to decide which production to use when it is necessary 
to expand a given non-terminal. 

GE solves two main problems. First, we might have an invalid gene value 
when we want to apply a production number to expand the non-terminal. 
In this case we consider the gene value mod maxValidValue: that is for the 
given non-terminal we consider the gene value modulo the maximum pro- 
duction number for the given non-terminal. The second problem is what 
happens when we used all the genes and we still have non-terminals to ex- 
pand. Then GE proposes to start to use the string of genes once again from 
the beginning. 

Applying a similar technique in a TAG Evolutionary Algorithm we can 
construct a derivation tree and hence a derived tree whose yield matches on 
a given input string. 


Basic aspects of TAG (LTAG) evolution 


Suppose that we have a TAG = (N, T, I, A, S) and a given input string. 
We want to find a derived tree that starts with S and whose yield matches 


; TRIANGLE 6 e December 2011 


Applications of Evolutionary Algorithms in Formal Languages 53 


a given input string. Starting from an arbitrary S-type tree, we may apply 
substitutions and adjoins to develop a derived tree. We stop the searching 
process when the yield of the derived tree matches on a given input string. 
As presented, the searching process is exponential, and at every step there 
are several possible options to choose from. In fact, from the beginning we 
may choose from several S-type trees, so in the derived tree we may choose 
from several nodes to apply the next derivation and once a node has been 
chosen, we may have several possible trees to substitute or to adjoin in the 
given node. 

Evolutionary Algorithms’ individuals represent solutions for a given 
problem. The most complicated problem is to represent a derived tree in 
the TAG formalism using a fixed number of genes. 

Suppose that we have the TAG = (N, T, I, A, S) and there are |1| initial 
trees and |Is| initial S-type trees. We order all the trees in the sets I and A 
and all the nodes in every tree according to the node position in the string- 
tree representation. Thus the tuples (tree number, node number) completely 
characterize all the nodes in all the trees. We start to build a derived tree and 
we carry on in the derived tree the nodes’ attributes such as “substitution 
node” or the adjoining constraints. Now we can start to build the derived 
tree. 

We use the first gene mod |Is| to select the starting tree from the initial 
S-type trees. 

This initial tree will develop the derived tree. 


1. We repeat the algorithm’s steps until the length of the yield of the de- 
rived tree will be greater than or equal to the input string. If we finish 
the genes during this process, we start to use the string of genes from 
the beginning again. We count the non-terminals that do not have the 
{NA} constraint in the derived tree and let ning, be the maximum num- 
ber of the non-terminal node after the counting operation”. We use the 
next gene mod nNinax to select the next node where we apply a derivation 
step 

2. e Ifthe selected node is a substitution node then we count the trees that 

could be substituted in our node. Let 18,2, be the maximum number 
of a substitution tree. We use the next gene mod 18;q; to select the 
next substitution tree and after performing the substitution, we go 
to the step 1 


? If we have Obligatory Adjoining constraints, they must be satisfied first. 
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e If the selected node is an adjoin node then we count the trees that 
could be adjoined in our node, let naingy be the maximum number 
of an adjoin tree. We use the next gene mod najngx to select the next 
adjoin tree and after performing the adjoining, we go to step 1. 


We can optimize the usage of genes and whenever we have a single option 
for the next operation like a single tree or a single node to choose from, we 
can perform the operation without consuming the gene. 
We present the algorithm that describes the genetic decoding. We give 

only the adjoin part, because the substitution is similar. 

1) i=0 {counter for genes index} 

2)  evolvedTree = initialTrees|gene|i| mod |Is|| 

{Is is the set of the initial S-type trees } 
3) do while len(yield(evolvedTree)) < len(is) {is : input string} 


4) i=(i+1) mod ng {ng : number of genes} 
5) Nmax = |{internalNodes in the evolvedTree}|— 
| {internalNodes with NA attribute in the 
evolvedTree }| 
6) adjNode = nonTerminalCandidates|gene|i] mod nmax| 
7) i=(i+1) mod ng 
8) adjSet = adjNode.Label — type auxiliary trees 
9) Namax = |adjSet| 
10) i=(i+1) mod ng 
11) insertedTree = adjSet|gene|i] mod namax| 


{pl is the position of the adjNode 
in the evolved tree} 

{p2 is the position of the corresponding 
closedBracket in the evolved tree of the adjNode} 


12) t1 = evolvedTree.substring(0, p1) 

13) 2 = insertedTree.split(“*’')[0| 

14) {the left part of the insertedTree, 
15) until the foot symbol “x’’} 

16) {3 = evolvedTree.substring(p1 + 1, p2) 
17) t4 = insertedTree.split(“*’’)(1] 

18) {the right part of the insertedTree, 
19) after the foot symbol “«'} 

20) {5 = evolvedTree.substring(p2) 

21) evolvedTree = t1+1#2+134+t4+15 
22)  enddo 
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In the “Running Examples” section we show how the decoding function 
works. 


Fitness function complexity 


The fitness function assigns values to individuals developed by the Evolu- 
tionary Algorithm. It is the most important factor that directs the searching 
process of the Evolutionary Algorithms. Therefore, a fitness function that 
says “yes” or “no” to the individuals of an EA is completely useless for the 
searching process, because the EA cannot know if a new individual is a little 
bit better or worse than another individual. 

In our algorithm the characters in the input string and in the yield of the 
derived tree must match and the two strings must be of equal length. 

We can use several types of fitness function. The fitness function can 
take values on N°, the first value representing the maximum length of 
a sequence of matched characters, the second value being the number of 
matches, and the third value having negative values for yields longer than 
the input string. When we compare different individuals during the selec- 
tion process, the first criterion is the most important, then the second and 
then the third. 

Let M be the number of generations after which we stop the evolution 
of the TAG. We assume that the time complexity of the crossover, mutation 
and selection operators is less than the fitness function evaluation complex- 
ity. Under these circumstances we can say that our algorithm has a time 
complexity of O(M- (u+ A) - Time(ff)), where pz is the population size in 
the algorithm, A represents the number of children and Time(f f)) is the time 
complexity of the fitness function. During our tests we started with O(n?) 
complexity for the fitness function, which we then reduced to O(n?). But 
the results were best with a linear fitness function. 


Running examples 


We adapted the string-tree notation to simplify the internal representation 
of the trees. We used curly brackets to specify the constraints, rectangular 
brackets to specify the nodes subordinated to a nonterminal and a blank 
separator after terminals. We considered that nonterminals start with an 
uppercase character, the strings can be as long as required, constraints are 
included and finally a “]” is used to indicate the end of the nonterminal 
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representation. Inside a balanced pair (“[”, “]”) we have all the children of 
the nonterminal. A foot node of an auxiliary tree has no children so the 
pair(“[”, “]”) is not necessary and instead, we have only the foot marker 
that is the ”*”. 

For tests we used the grammar TAG G = (N = {5S}, T = {a}, I] = 
{a1 : S{na}[a S[a ]], a : S{na}[b S[b ]]}, A = {Bi : S{na}|a S[S{na} «a |], 
Bo : S{na}[b S[S{na} « b ]]}, S) that generates L(G) = {wwlw € {a, b}*}, 
known as the copy language. In order to explain the decoding algorithm 
better we illustrate it on an example. For the grammar mentioned above 
and for an input string is =“aaaabbbabbaaaabbbabb”, len(is) is 20 and the 
number of initial S — type trees |Is| = 2. Suppose that we have the follow- 
ing string of genes: 113, 110, 248, 173, 119,.... According to the decoding 
algorithm instruction no. 2), gene[0]=113, the evolvedTree is “S{NA}[b S[b 
||”. The length of the yield of the evolvedTree is less than 20 and the algo- 
rithm will continue with the cycle from the third instruction. Next ninax is 
1 and here due to the optimization of genes’ usage we do not increment 
the gene counter as described in the fourth instruction of the algorithm. The 
adj Node position is 8 (string index starts from 0), adjNode.Label is “S”, namax 
is 2, gene[1]=248, the insertedTree is “S{na}[aS[S{NA}*a]]”. We have p1 = 8, 
p23 13,11 S="SINA} |b", 12 ="“S{NA} a SSI NAM, B=" |b | 44S "o 
||", 5 =“), evolvedTree = “S{NA}[b S{NA}|a S[S{NA}[b Ja]]]”. 

The evolution cycle continues and we get the following evolvedTrees: 
evolvedTree = “S{NA}[bS{NA}[aS{NA}|a S|[S{NA}[S{NA}[b Ja ]a }]]]" 
evolvedTree = “S{NA}[b S{NA}|a S{NA}|aS{NA}|b S[S{NA} [S{NA} 
(S{NA} [b]a]a Jb II", 
evolvedTree = “S{NA}[bS{NA}[aS{NA}|aS{NA}|b S{NA}[b S{NA}[bS 
{NA} [aS{NA}|aS{NA}|aS{NA}[aS[S{NA}[S{NA}|[S{NA}[S{NA} 
IS{NA}IS{NA}IS{NA}IS{NAFIS{NA}(b ]a Ja Jb] ] b Ja Ja Ja Ja |IIINII". 

First we studied the behavior of the EA for several input examples, and 
then we tried to compare the results with the classical parsing algorithm. 

For the general behavior of the EA when it solves TAG parsing, we tested 
two input strings, “aaaabbbaaaaabbba” and “aaaabbbaabbaaaaabbbaabba” 
with lengths 16 and 24, respectively. 

We used an evolutionary algorithm with 15 individuals as the popula- 
tion size. Each individual had 20 genes with values between 0 and 255 (one 
byte). We managed to simplify the fitness function because we stopped the 
evolution of the derived trees as the length of the yield was equal to the 
length of the input string for our particular grammar. We also considered 


x 


a 
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Input string 
aaaabbbaaaaabbba aaaabbbaabbaaaaabbbaabba 

gen. |Max |Average |Min |gen. |Max Average | Min 
0 7 3.27 2 0 5 2.67 1 
1 7 6.27 6 1 8 6.33 5 
2 16 7.13 6 2 8 6.93 6 
3 16 7.40 6 3 12 8.20 7 
4 16 7.87 7 4 12 8.73 7 
5 16 8.67 7 5 12 9.07 8 
6 16 11.20 8 6 12 9.20 8 
7 16 11.20 8 7 12 9.53 9 
8 16 11.20 8 8 12 9.73 9 
9 16 12.80 8 9 24 11.80 9 
10 16 15.47 8 10 24 11.93 9 


Table 6. Results of tests of EAs for TAG parsing 


the fitness function to be the maximum length of matching characters be- 
tween the input string and the yield of the derived tree considered from the 
beginning and the end of the strings. 

We present the results of the runs in table 6, where gen represents the 
generation number, is is the input string, Max represents the best fitness 
function of an individual during one generation, Average is the average fit- 
ness function of individuals during one generation and Min is the minimum 
fitness function of an individual during one generation. 

Theoretically the classical algorithm for parsing has the worst case com- 
plexity O(n°). The problem is that for many examples the classical algo- 
rithm does not reach the worst case and we believe that more important 
than comparing the results for two algorithms would be to compare the 
results for an average behavior. On the other hand, even if the EA uses a 
linear fitness function, the number of generations multiplied by the num- 
ber of individuals in the population could lead to a considerable volume of 
computations while solving a parsing problem. 

To compare the classical parsing algorithm with the EA we used a an 
empirical method to measure the number of computations. In every cy- 
cle we incremented global variables called computations. We estimated the 
number of computations for both the classical algorithm and the EA us- 
ing the same input string. We also used other comparative methods such 
as measuring the time it took to find the solution. The only problem was 
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that we implemented the algorithms in two different programming environ- 
ments (VBA and Java) and the running time would have been influenced by 
other aspects, not only by the complexity of algorithms. Therefore we again 
used two input examples whit lengths of 16 (“aaaabbbaaaaabbba”) and 20 
(“aaaabbbabbaaaabbbabb”). For the classical algorithm we needed only one 
run to determine the number of computations for an input example, while 
for EA we considered the average result after 10 tests. The results are pre- 
sented synthetically in table 7. 


| First example len(input)=16 |Second example len(input)=20 
| Computations Computations 
| classical 827787.0 2153088.0 
| evoAvg 268886.3 661745.5 


Table 7. Comparative tests for classical and EA TAG parsing 


4 Post running analysis 


Let us recall steps 12) to 21) from the algorithmic description of the adjoin 


operation. 
12) t1 = evolvedTree.substring(0, p1) 
13) 2 = insertedTree.split(“*’")(0| 
14) {the left part of the insertedTree, 
15) until the foot symbol “x’’} 
16) {3 = evolvedTree.substring(p1 + 1, p2) 
17) t4 = insertedTree.split(“*’)(1] 
18) {the right part of the insertedTree, 
19) after the foot symbol “«} 
20) {5 = evolvedTree.substring(p2) 
21) evolvedTree = t1+1#2+13+#4+15 


We can see in Figure 5 that the adjoin operation is similar to something 
like a synchronized pumping of two strings, the left side and the right side 
of an auxiliary tree with respect to the foot. 

We also know that TAGs have some limitations in the sense that the 
language a"b"c"d"e" is not a TAL. Considering the previous observations, 
we define two formalisms that can extend the TAG formalism. 
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x 
Y 
Ny, 
(a) —————_—> () 
tl 3 tS “ 
t2 


(B) 
t4 tl tS 


Fig. 5. Synchronization in TAG formalism 


Notations: Ng = NUN, N = {A | A € N}, Dnt is the Dyck language 
over N, “enriched” with terminals (see its grammar G below), which we call 
bracketed language. 


G=(S,NegUT,P,S), 

P={S>A,S > SS}U{S > a|aeET}U{S > aSa | ae N}. 
Definition 9. A step-synchronized rewriting system (SSR) is a quadruple G = 
(N, T, P, S), where N is a finite set of non-terminal symbols, T is a finite set of 


terminal symbols (NT = @), S € N is a starting symbol, and P is a finite set of 
rules that are tuples of context-free productions. 


Example 2. Consider the following system 


G, =({5,4,8,C,D} 440,000 f, 2,1), 2,5), 
where P = {(S — ABCD), 
(A > aAb,C — cCd),(A > ab,C - cd), 
(B + eBf,D — gDh),(B— ef,D — gh)}. 


For any i > 0 we define the sets N; = {A | A € N} (intuitively, 
symbols added at step i of a derivation), Vj = N;U T, and a homomorphism 
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hi : V — V; defined by hj(a) = a for all a € T and hj(A) = A“ for all 
A €N. We also define N, = {A; | A € N,i > 0} and V, = N, UT. 

Configurations of the systems G are represented by a string in V,;* and a 
number. The starting configuration of G is (S),0). We say that a configu- 
ration (w,j) directly derives (w’, j’) (denoted as ae iff j’ = j+1and wand 
w' can be represented in the eae way: 


w= w Aw, wpa? wes, 
w! = wyhjy1 (x1) We - + - wehjer (Xk) We, 
(Ay > xX1,°°+, Ag — Xx) Ee P. 


The derivation relation (=*) is defined as a reflexive and transitive closure 
of =. The generated language is defined as L(G) = {w € T* | (S,0) >* 
(w,n)}. 


Example 3. For the system Gj, consider the derivation: 

(sO, 0) => => (AMBYCMDM1) =* 

(a ata (m+ pm BAU ome(m+) gmp) im +4 1) =* 

(a ata (nebl) pit gn plata tT) prom Clmetl) gi gt pimtnt+) pt m tnt 1) = 
(a qmtipmtl, nB(mtntl) prom+lgm+lonp(mtntl) pH a +n +2) = 

(a gttlpm+l, fil pital omeek ape rl gtttd el tn+ 3) (see Figure 6). 


The synchronization relation is represented by connecting the synchronized 
nonterminals in the sentential form. 


— a 
(a A" +1) piren peter) pn cle rl) gm etn (m+n-+1) yn m+tn+t 1) 
u_ sd 


Fig. 6. For Example 3, the synchronization is by the derivation step. 


We define SSRS, as the set of all step-synchronized rewriting systems 
G = (N,T,P,S), where P C (N x V*)* (at most k productions in all rules). 
We write SSRS, = Ugs1 SSRSy. 
Observation: L(SSRS;) = CF. 
Observation: {aj --- a5, | n > 1} € L(SSRS;,). 


Definition 10. A bracketed-synchronized rewriting system (BSR) is a 4-tuple 
G = (Nz, T, P, S), where Ng is a finite set of bracketed non-terminal symbols, T 
is a finite set of terminal symbols (Ng NT = ©), S € Dy, isa finite set of starting 
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axioms and P is a finite set of rules that are pairs of context-free productions of the 
form (A + w1,A > wz), where wyw2 € Dnt. 


We say that w directly derives in w’ (denoted as w => w’) iff w and w’ 
allow representation: 


w= w Aw rAw3, W2€ Dnt, 


w! = W1X{W2x2W3, (A 4 x4,A 4 x2) € P. 


Observation: In each derivation starting from an axiom, every sentential 
form is in Dy,7. The derivation relation and the language are defined as 
usual. BSRS is the set of BSR systems. 


Example 4. For the system G2 = ({A, A}, {a,b,c,d},P, AAAA), P={(A> 
aAb, A - cAd), (A — ab, A > cd}), consider the following derivation: 
AAAA =>* Aa"—1Ab"-1c"-1lAgr-14 =* 

qi LApm 1gn 1 Ap" Ign 1Aq" 1om 1Aq™ 1 = 
a™-lApmlgrpncengnem—l Agm-l = gmpmgnpnengncomgm (see Figure 7). 


qt 1 Ap 1yn 1 Ap” Ign VA qn 1m 1,4 qm 1 
a | 
| 


Fig. 7. For Example 4, the synchronization is by the paired brackets: the nonterminals 
are synchronized if the substring between them is in the bracketed language Dy, r. 


Theorem 1. L(BSRS) C L(SSRS3). 


Proof sketch. Given a BSR G = (Nz,T,P,S) we construct a SSR G! = 
(N’,T,P’,S’), where P’ = {(A; > c(x1), Aj  c(x2)) | (A 4 1,4 > x2)}. 


Observation: An N, T-tree can be represented as a string by the mapping 


str : Tn(T) > (Ng UT)*, defined as 
str(ta()) =a, a€ TU {A} (a single-node tree labeled a), 
str(ta(t1,--- ,tm)) = Astr(ty)---str(tm) A, AEN 
(a tree labeled A with subtrees t1,--- ,t1). 


Likewise, words w € Ry 7 can be mapped into N, T-trees by the mapping 
tree: Ry t > Tn(T) 
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a(), a € T, 

A(tree(w1),--+- ,tree(Wm)), AEN, 
W1,°*+ ,Wm © Rn,r, m = 0, 

Ry,t = TU sey APA 

(Ry,r = “rooted” bracketed language). 


tree(a) 


=t 
tree(Aw,:--WmA)=t 


Observation: We recall that the foot of any auxiliary tree has an NA at- 
tribute. Without restricting the generality ((yield) language family), we also 
assume that the head labeled B of any auxiliary tree has an NA attribute 
(otherwise replace such a tree t by the tree tY4(t)). 


Theorem 2. L(TAG) = L(BSRS). 


Proof sketch. 
C For a TAG G = (T,N,I,A,S) we construct the equivalent BSR system 
G' = (T,Nz, P,S') with S' = {h(str(t)) | t € I} and P = {(B > h(x1),B > 
h(xz)) | t = tp(t,-++,tm) € A: str(t) = BN4x,BN4x2BNA} 
> For a BSR G’ = (T, Nz, P,S’) we construct the equivalent TAG 

G = (T,N,1,A,S) where S is a new symbol, 

I = {tree(SN4wS4 | we S')}, 

A = {tree(BN4h® (x,)BNAh® (xy) BN4) | (B > x1,B > xp) € P}, 

h°(B) = B°4, BEN, h?(a) =a, aT. 


Every derivation in G’ corresponds to a derivation in G and vice-versa, so 
L(G) = L(G) 


The notion of BSR can be extended, considering tuples instead of pairs 
of bracketed symbols, using the extended language Dir (see Figure 8), 
defined by the grammar below 


Gk = (S,NKUT,P,S), NE={AM®| AEN, 1<i<hj, 
P={S 5A,S > SS}U{S >a|aeET}U 
{§ > SAMs.--SA“S| Ae N}. 


We can define N5* where the bracketed relation has at most k items. 
We define BSRS; as the set of all bracket-synchronized rewriting systems 
G= (NS, T,P,S). We write BSRS, = Ups BSRSy. 

Observation: {a7 ---a%, | n > 1} € L(BSRSy,). 
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AY AQ A@ AD BO) BR pO) AZ AM 4) 43) 43) BO Bp ps) 
a | Ld Ld a ee 
Po 
Fig. 8. For the extended BSR systems we show an example of the ternary synchro- 
nization. 


5 Possible extensions and further research 


After extending the BSR formalism we get a formalism that is very similar 
to Coupled Context-Free Grammars, or Klammergrammatiken [16]. We also 
find the step synchronized rewriting proprety defined as locality by Owen 
and Satta in [14]. Using Recursive Matrix Systems (RMS) [10] we can also 
simulate the derivations in SSRS. 

Future research could go in several directions. First we could try to find 
a connection between EAs for TAG parsing and DNA computing. 


Hage 


Fig. 9. LoopDirect (ld) operation in gene assembly in Ciliates 


Both domains, EAs and DNA computing, need to code solution some- 
how for a given problem. 

If we look at the process of gene assembly in Ciliates (we can find several 
Formal Frameworks described in [9]) we observe that parts of the DNA 
act as pointers as we can see in Figure 9 where MDS means Macronuclear 
Destined Sequences and IES is Internally Eliminated Sequences. 

We believe that there could be a connection between the non-Terminals 
in BSR and non-overlapping pointers in self assembling in Ciliates and if 
we could do a synchronized instead of the elimination operation insertion 
as in Figure 10 we would obtain the adjoin operation as in Figure 5. 
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Fig. 10. Synchronized insertion in DNA molecules 


As a second research direction we could try to simulate the BSR systems 
without rewriting rules, only adding contexts using Contextual Grammars 
[15]. 

Also, for future developments, we will run the EAs algorithms for more 
complex grammars including natural language 

Parsing using the English Grammar available in the XTAG Project [19]. 
We will also try to guide the searching process of the EAs by using some sta- 
tistical information. In order to solve the negative examples problem when 
using EAs, we will combine the classical parsing algorithm with the EA in 
a concurrent manner. We will thus be able to use the result of the algo- 
rithm that arrives first to a conclusion. In the final phase of our research 
we will focus on online tests and comparative results for natural language 
parsing. Several theoretical results regarding the probability of not finding 
the solution during one generation of the EA are also expected. 


6 Concluding remarks 
We have proposed an Evolutionary Algorithm for Tree Adjoining Grammar 


parsing. We can observe that the classical parsing algorithm needs approxi- 
mately 3 times more computations than the EA to solve the same problem. 
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One of the disadvantages of the EA parsing algorithm is that, for nega- 
tive examples, the EA will not be able to say that there is no solution. We 
may not have let the algorithm run enough generations, but we ran some 
tests for positive examples and we approximated the requested number of 
generations required to find a solution for a certain length of the input 
string. 

Finally, one interesting aspect of the EA parsing algorithm is that if the 
grammar is ambiguous, in one run of the same input string we found dif- 
ferent parsings for the different individuals in the population. 

The string-representation for trees representation could be a starting 
point for developing new and more efficient TAG parsing algorithms. 

We believe that our algorithms are a starting point for developing new 
models for knowledge-based representation systems, automatic text sum- 
marization etc. 
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Summary. This article brings together some rather powerful results on P systems 
in which the computation is performed by the communication of objects through 
symport and antiport rules considering the trace of an object through membranes, 
on the one hand, and by P systems with object-rewriting non-cooperative rules, 
promoters/inhibitors at the level of rules and only one catalyst, on the other. It is 
recalled here that computational universality can be reached whit these formalisms 
and that some of the proofs can be sketched. Three ideas are also put forward to 
brake the direct relationship (infinite hierarchy) induced by the size of the consid- 
ered alphabet and the number of the membranes needed in a P system (with traces) 
to generate recursively enumerable languages on the chosen alphabet. 


1 Introduction 


The present study focuses on P systems with a purely communicative func- 
tioning from two different perspectives: the “classical" one, in which the 
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result of the computation is the (number of) objects collected in a specified 
membrane, as introduced in [6]; and a “non-classical” one in which the re- 
sult of the computation is a trace of a certain object (that is, the string of 
labels of the membranes visited by this object, as introduced in [20]). 

Symport/antiport phenomena are inspired by the biological process in 
which two molecules pass together, simultaneously, through a membrane, 
in the same direction (symport), or in opposite directions (antiport). For 
further biochemical details the reader is asked to see [1] and [2]. Technically, 
the rules used in P systems as models of these biological processes are of 
the form (x,in) and (x,out), as models of symport, and (x,out;y,in) as a 
model of antiport, where x,y are strings of symbols representing multisets 
of chemicals. Of course, this is a generalization of what happens in biology, 
where mainly pairs of chemicals are coupled. Several classes of P systems 
of this type were considered in [4], [6], [7], etc. 

The other formalism we recall here is the one of P systems with pro- 
moted/inhibited rules, which also has a strong biological motivation. More 
precisely, a promoter is a chemical within a living cell which makes a reac- 
tion happen only in its presence. The inhibitor is the opposite: the reaction 
can not take place if a certain chemical is present in the cell. These biologi- 
cal considerations have been formalized in [12] as u > v|,(u evolves to v in 
the presence of the promoter a), and u — v|_, (u cannot evolve to v if the 
inhibitor b is present in the same membrane region). 

For the reader’s convenience, we recall the fact that P systems are dis- 
tributed parallel computing models which abstract from the structure and 
the functioning of the living cells. In short, we have a membrane structure, 
consisting of several membranes embedded in a main membrane (called 
the skin) and delimiting regions where multisets of certain objects are placed 
(Figure 1 illustrates these notions); the objects evolve according to given evo- 
lution rules, which are applied nondeterministically (the rules to be used and 
the objects to evolve are randomly chosen) in a maximally parallel manner 
(in each step, all objects which can evolve must). The objects can also be 
communicated from one region to another. In this way, we get transitions 
from one configuration of the system to the next. A sequence of transitions 
constitutes a computation; with each halting computation we associate a result, 
the number of objects from a specified output membrane. 

Details can be found at the web address http: //bioinformatics.bio. 
disco.unimib.it/psystems. 
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Figure 1: A membrane structure 


One particular case of the abovementioned evolution rules is that of cat- 
alytic rules, which model the biological reactions that can take place only 
with the help of certain enzymatic proteins (which participate in reactions 
and remain unmodified after they occur). Another important type is pro- 
moted/inhibited reactions, which take place in the presence/absence of cer- 
tain chemicals that are not directly involved in reactions. 

In this abstract, symbolic, mathematical framework it is interesting to see 
what the computational power is when “low" cooperation features are used. 
In this regard, as was shown in [14], P systems with non-cooperative and 
catalytic rules and only two distinct catalysts are computational universal. 
Also, in [12] a model with non-cooperative rules, one catalyst and promoters 
at the level of rules is shown to be universal. 

In this paper we emphasize the computational power of the systems 
with non-cooperative rules, catalytic rules with one catalyst and promot- 
ers /inhibitors in both generative and accepting cases. We will also mention 
the latest results regarding the computational power of the “traced" variant 
of P systems. 
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2 Preliminaries 


The language theory notions we use here are standard, and can be found, 
for instance, in [10], [11]. We only mention that we denote by V* the free 
monoid generated by an alphabet V; A is the empty string, |x| is the length 
of x € V*, and |x|, is the number of occurrences of the symbol a € V in 
the string x € V*. For x € V* we denote alph(x) = {a € V | |x|qa > 1} 
(the set of symbols appearing in x), and for a language L C V* we write 
alph(L) = Uxez al ph(x). By REG, CF, CS, REC, RE we denote the families of 
regular, context-free, context-sensitive, recursive, and recursively enumer- 
able languages, respectively. 

In the (sketches of the) proofs of the theorems presented here, we used 
tools such as Regulated Rewriting (more precisely the fact that the families 
of languages generated by regularly controlled context-free grammars with 
appearance checking and erasing rules are equal in generative power to 
the family of all recursively enumerable languages over the same alphabet), 
and the generative power of Register Machines (which are computationally 
universal). Those readers who are familiar with these concepts can skip the 
first two subsections. 


2.1 Regulated rewriting 


In any Chomsky grammar, at some given step in a derivation one can use for 
rewriting any applicable rule in any desired place of the sentential form. In 
order to restrict this nondeterminism some regulating mechanisms, which 
can control the derivation process, were considered. Using such regulations 
we can reach computational universality even if we use context-free gram- 
mars as a core generative device. In the literature there are many types of 
regulations which restrict the use of rules in a Chomsky grammar (see [3], 
[18]). Here we will present only regularly controlled grammars with appear- 
ance checking and A-rules. 


A regularly controlled context-free grammar with appearance checking is a 6- 
tuple G,c = (N,T,P,S,R,F) where N,T,P, and S are specified as in context- 
free grammars, R is a regular language over P, and F is a subset of P. 

For a rule p= Aw € P and x,y € Vé we write x =>; y if either 


1. x =x, Ax and y = xywWXx2, or 
2. x =y, A does not appear in x, and p € F. 
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The language L(G) generated by G with appearance checking consists 
of all words w € T* such that there is a derivation 


ac — 
pn On = W 


S 5, W1 py, W200 
with pip2---pn € R. 

By L(ArCac) we denote the families of languages generated by regularly 
controlled grammars with appearance checking and erasing rules. The fol- 
lowing result stands: 


L(ArCac) = £(RE) 


where by £(RE) we denote the family of all recursively enumerable lan- 
guages over the same alphabet T. 


2.2 Register machines 


The power of Minsky’s register machine [16] was also used in some of the 
proofs of the theorems that follow, which is why we recall this notion here. 
This machine runs a program consisting of numbered instructions of several 
simple types. Several variants of register machines with a different number 
of registers and different instructions sets were shown to be computationally 
universal (see [16] for some original definitions and [15] for the definition 
we use here). 


A n-register machine is a construct M = (n, P,i,h), where: 


e nis the number of registers, 

e Pisaset of labeled instructions of the form j : (op(r),k,1), where op(r) is 
an operation on register r of M, and j,k,1 are labels from the set Lab(M) 
(which numbers the instructions in a one-to-one manner), 

e iis the initial label, and 

e his the final label. 


The machine is capable of the following instructions: 


(add(r),k,1) : Add one to the contents of register r and proceed to in- 
struction k or to instruction /; in the deterministic variants usually consid- 
ered in the literature we demand k = 1. 


(sub(r),k,1) : If register r is not empty, then subtract one from its con- 
tents and go to instruction k, otherwise proceed to instruction /. 
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halt : This instruction stops the machine. This additional instruction can 
only be assigned to the final label h. 


A deterministic m-register machine can analyze an input (N1,..,Mg) € 
Np in registers 1 to w, which is recognized if the register machine finally 
stops by the halt instruction with all its registers being empty(this last re- 
quirement is not necessary). If the machine does not halt, the analysis was 
not successful. 


2.3 P systems prerequisites 


A P system (of degree m > 1) with symbol-objects and rewriting evolution 
rules is a construct 


Il= (V, C, H,W4,.--,Wm, (Ri, 1), sey (Rin, Pm), io), 


where: 

e V is the alphabet of IT; its elements are called objects; 

e CC Vis the set of catalysts; 

e pisamembrane structure consisting of m membranes labeled 1,2,--- ,m; 
e wj,1<i<™m, specify the multisets of objects present in the correspond- 


ing regions i at the beginning of a computation; 

e R;,1 <i < m, are finite sets of evolution rules over V associated with 
the regions 1,2,...,m of yu, and p; is a partial order relation over R; (a 
priority relation); these evolution rules are of the form a — v or ca — cv, 
where a is an object from V — C and 7 is a string over 


(V —C) x ({here, out, in}) 


(In general, the target indications here, out, in are written as subscripts 
of objects from V.); 

e ig is a number between 0 and m and specifies the output membrane of 
II (in case of 0, the environment is used for the output). 


Starting from the original model of P system, several variants were pro- 
posed (see [17]). One of them is P systems with promoters/inhibitors and was 
introduced, as mentioned previously, in [12]. In the case of promoters, the 
rules (reactions) are possible only in the presence of certain symbols. An 
object a is a promoter for a rule u — v, and we denote this by u — v|,, if the 
rule is active only in the presence of object a. An object b is an inhibitor for 
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a rule u > v, and we denote this by u — 0|~», if the rule is active only if 
inhibitor b is not present in the region. In particular, promoters /inhibitors 
themselves can evolve according to some rules. 

The difference between catalysts and promoters consists of the fact that 
the catalysts directly participate in rules (but are not modified by them), and 
they are counted as any other objects, so that the number of applications of 
a rule is as big as the number of copies of the catalyst, while in the case of 
promoters, the presence of the promoter objects makes it possible to use the 
associated rule as many times as possible, without any restriction; moreover, 
the promoting objects do not necessarily directly participate in the rules. As 
a consequence, it can be seen that the catalysts inhibit the parallelism of the 
system while the promoters/inhibitors only guide the computation process. 

The P system with the mentioned features starts to evolve from the initial 
configuration, as in the classical P system, to the final configuration. The 
result of the halting computation is the number of objects present in the 
region ig in the halting configuration. The set of all numbers constructed in 
this way by a system II is denoted by N(II). For this kind of P systems we 
will use the following notation: 


NOP (a, B),« € {ncoo, coo} U {cat, | k > 0}, B € {proR, inhR} 


to denote the family of sets of natural numbers generated by P systems 
with at most m membranes, evolution rules that can be non-cooperative 
(ncoo), cooperative (coo), or catalytic (cat;), using at most k catalysts, and 
promoters (proR) or inhibitors (inhR) at the level of rules. 

We may also consider the vector ¥ (w) as the result of the halting compu- 
tation (the vector of multiplicities of objects) where w is the multiset present 
in the region ig in the halting configuration. In this case, the set of all vectors 
constructed in this way by a system I is denoted by Ps(I1). We will also 
use the following notation: 


PsIP (a, B),« € {ncoo,coo} U {cat, | k > 0},B € {proR, inhR}, 


to denote the family of sets of vectors of natural numbers accepted by 
P systems with at most m membranes, evolution rules that can be non- 
cooperative (coo), cooperative (coo), or catalytic (cat;,), using at most k 
catalysts, and promoters (proR) or inhibitors (inhR) at the level of rules. 
Here, I stands for P systems with internal input. For generative devices we 
will write PsPy,(---,+--). 
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In this paper we will recall how the regularly regulated context-free 
grammars with appearance checking can be used to prove the computa- 
tional universality of this type of P systems.We will also present the deter- 
ministic P systems that accept sets of vectors of natural numbers. 


We also recall here the case in which the trace of a certain object is con- 
sidered, so we have P systems of the following form: 


II = (V, t, T,h, B,W1,. ie .,Wm,E,Ry,. ‘ -/Rm), 


where V is an alphabet, t € V (a distinguished object, “the traveler"), T is 
an alphabet, h : {1,2,...,m} —> TU {A} is a weak coding, w1,...,Wm are 
strings representing the multisets of objects present in the m regions of y, E 
is the set of objects present in arbitrarily many copies in the environment, 
and Rj,...,Rm are the sets of symport and antiport rules (with promoters 
or inhibitors) associated with the m membranes. The traveler is present in 
exactly one copy in the system, that is, |w ...Wm| = 1 andt ¢ E. 

Let 7 = Ci,Co...Cy,k = 1, be a halting computation with respect to IT, 
with Cy = (w1,...,Wm,A) the initial configuration, and C; = (29,...,29, 
zi) the configuration at step i,1 <i <k. If iz |4 =1forsome1l <j<™m, 
then we write C;(t) = j (therefore, C;(t) is the label of the membrane where 
t is placed). If [2 |e = 0 for all j = 1,2,...,m, then we put C;(t) = A. Then, 
the trace of t in the computation ¢ is 


trace(t,a7) = Cy(t)Co(t)...C,(£). 


The computation ¢ is said to generate the string h(trace(t,o)). Hence the 
language generated by IT is L(II) = {h(trace(t,7)) | 7 is a halting compu- 
tation in IT}. 

We denote by LPin(psymp, panti,) the family of languages generated by P 
systems with at most m membranes, with symport rules of weight at most p 
and antiport rules of weight at most q, using promoters; when the rules have 
associated forbidding contexts we write fsym, fanti instead of psym, panti; 
when the rules are used in the free mode (they have no promoter/inhibitor 
symbols associated), we remove the initial “p" and “f" from psym, panti and 
fsym, fanti. As usual, the subscript m is replaced by * when no bound on 
the number of used membranes is considered; similarly, if we use symport 
or antiport rules of an arbitrary weight, then the subscripts p,q are replaced 
with «. 


; TRIANGLE 6 e December 2011 


Symport/Antiport and Traces in P Systems 75 


3 Some relevant examples 


In this section we will present some examples of P systems computing some 
“sensitive” tasks using the types of P systems discussed above. First we will 
construct a P system with promoters that, having as input two values, say 0 
and/or 1, computes the and operation (see Figure 1). 


a rey 


0 = Oout| ar 
A’ Al" 
0 — 0 Aout 0 Alan 
0 > Oout c0! > COout A" +A 
1 Igie 131 BB 
V > 1" Bout 1 Alp 
2 cl” > clout B! — B"” 
2 1 > outlay 


cL BU +A oy, 


Fig. 1. Simulation of the AND gate using promoters and one catalyst 


Formally, we define the following P system 


Tlanp = (V,C, pi, 1, W2, W3, Ry, Ro, R3,0), 


where: 

* V={010,1 1) 4454 8,8 80) 
e C= {ce}; 

© #= (sll jiols: 

e W1 = W3 = ©, W2 = {c}; 

e Ry _ {1 = lout, 0 > Oous 


Ry = {0 3 0! Aout, cO’ + COout, 1 4 1" Bout, 1 > 1, 
cl” — Clout }; 

R3 = {A > A’',0 > 0outl ar, A’ @ AY, 0 > Alan, 
A” — A, B- B’,1 > Alp, BY > B", 
1-> lout| Br, B" + Abs 
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The simulation of the AND gate uses the catalyst c to inhibit the par- 
allelism and to separate the entrance time of objects 0 and 1 into region 3. 
Depending on the entrance time, objects will be either deleted, or sent out 
into the environment. More specifically, if we consider that initially we had 
two objects 0 inside region 2, the rule 0 + 0'Aout is executed. Its role is to 
introduce object A into region 3 to set up the “right" configuration of the 
region. Next, in region 2 the only applicable rule is cO0! + cOou, which will 
introduce one object 0 into region 3. At the same time, in region 3 the rule 
A — A’ is executed. Now, we will have in region 3 the objects A’ and 0, 
and the rules that will be applied are 0 > Oout|,4: and A’ > A”. These rules 
guarantee that an object 0 is sent out into the environment. In the meantime, 
in region 2, the remaining object 0’ reacts with the catalyst c and an object 
0 will be introduced into region 3 (the rule used is again c0’ — cOout). Here, 
object 0 will find a different context since now, in region 3 there is no object 
A’. Therefore, the rules 0 + A| 4” and A” — A are applied. Hence the initial 
configuration of the system is restored. Basically, a similar method stands 
for the other cases, with some minor changes: objects 1 enter into region 3 
with one computational delay (because of the rule 1 — 1’ present in region 
2) in order not to influence the processes executing in region 3; the first 
object 1 that enters into region 3 is deleted (as opposed to the above case 
when the first object 0 that arrives in region 3 is sent out) by using the rule 
1-> A Bi: 

Recall that membrane 1 can be entirely avoided, its role being only to 
specify the entry point of the input. Also, the result of computation is sent 
out into the environment even if it is actually obtained in region 3. These 
features are useful when we want to connect gates to circuits (see [13] for 
more details). 


The second example (see Figure 2) uses context-free rules, inhibitors and 
one catalyst to compute the arithmetic difference between the initial multi- 
plicity of two distinct objects, present at the beginning of computation into 
an input region. 

Formally, we define the following P system 


Tyir = (V, CG H,W1,W2, Ri, Ro, 2), 


where: 
e V= {a,b,a',b',d,A,B,c}; 
e C= {c}; 
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< 


c,a", bm 
ca — ca’ d|_q) 
cb > cb’d|_y 
a! Aout|-b 
b med Bout|—a 
dA 
1 a’ —> Alaa 
bl > Alaa 


\a A 


Fig. 2. Integer subtraction using inhibitors and one catalyst 


# = [oli jaje; 

wy = {c,a",b™}, wr =@; 

Ry = {ca > ca'd|_g, cb — cb'd|_y, a — Aout|-o, 
b! => Bout|-a, d > A, a! a Alaa b! 7 A\-a}; 

Rao. 


The system starts the computation by inputting into membrane 1 a cat- 
alyst c and the objects a”, b"”, whose multiplicity we want to subtract. The 
result of computation is sent to region 2 and it is represented by: 


A" ifn > m; 
e B"-Tifm>n; 
no object is sent to region 2 meaning that m = n. 


We will not give many details regarding the computations here (these 
can be found in [19]). We only mention that the system works as follows: 
while there are still objects a and 5b, they are deleted in pairs, iteratively, until 
there are no more objects a, for instance (or objects b). At that moment, the 
flow of computation changes and as a result, also iteratively, the remaining 
objects b (or objects a, respectively) are sent out. During the computation, 
the promoters control the derivation process, while the catalyst inhibits the 
parallelism. 


The third example (detailed also in [20]) shows how a P system consid- 
ering traces of an object is computing. Consider the system 


Il = ({d,t},t, {a,b,c},h,[ 1[2[ 3[ 4[5 ls la Js Jo Jyt, 
O,O,O, ©, {d}, Ri, Ro, R3, Ra, Rs), 
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with h(1) =a,h(3) = b,h(5) =c,h(2) = h(4) = A, and the following sets of 


Ry, = R3 = R5 = {(t,out), (td, in)}, 
RS Re 1), Chant. 


First, the traveler brings n > 1 copies of d from the environment (each of 
them immediately enters membrane 2), and then the traveler goes to mem- 
brane 2. Subsequently the traveler brings m <n copies of d into membrane 
3 (each of them immediately membrane 4), and then the traveler goes to 
membrane 4 (at this moment, it is possible that some copies of d remain in 
membrane 2). From membrane 4, all copies of d are carried into membrane 
5; the computation stops with the traveler in membrane 4. Thus, for any 
computation ¢ of this type, we have 


trace(t,7) = 1"*1(23)"(45)"4, for some n > 1,m <n. 


The traveler can also end up in membrane 2, after introducing all copies 
of d in membrane 3, and returning to membrane 2. In the case of such a 
computation 7 we have 


trace(t,7) = 1"*1(23)"2, for some n > 1. 


Finally, we can also have the trivial computation where ¢ enters only mem- 
brane 2, without any copy of d present in the system, and this leads to 
trace(t,7) = 12. 

Consequently, 


L(11) = {a"+1b™c™ | n >1,m <n} U {a"*1b" | n > 1}U {a}. 


Clearly, this language is not a context-free one. Note that system II only has 
symport rules, and that the rules are freely applied (we use no promoter or 
inhibitor). 


4 Universality results - using P systems with 
promoted/inhibited rules 
4.1 Computational universality —- the generative case 


We present two universality results concerning P systems with promoters or 
inhibitors at the level of rules. Both proofs are based on the simulations of 
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regularly controlled context-free grammars with appearance checking. Only 
the sketch of the first proof is given here. The detailed proofs can be found 
in [19]. 

As we presented in Subsection 2.1, the family of languages generated by 
such grammars L(ArCac) is equal to the family of all recursively enumerable 
languages £(RE). As a particular case, by NRE we denote the family of 
Turing computable sets of numbers. This family is isomorphic to the family 
of length sets of languages generated by regularly controlled context-free 
grammars with appearance checking over one letter terminal alphabet. 

Recall also, that, because P systems with symbol-objects operate with 
multisets of objects (therefore we do not have the order given by strings) 
at most we can study the equivalence with a family of vectors of natural 
numbers. 

We denote by PsP,,(cat, proR), the family of sets of vectors of natural 
numbers computed by systems with at most m membranes, 1 catalyst (say 
c) and objects as promoters. 


Theorem 1. PsP,,(cat,, proR) = PsRE. 


Proof. We will consider for this proof the implication NRE C PsP(cat, proR); 
the other way around is a straightforward construction based on the Church- 
Turing thesis. 

Let Greg = (Nreg, Treg, Preg, Steg) be a regular grammar generating the 
regular set Lyeg. We denote by r the number of rules in Peg. The rules of 
Preg are enumerated as i: (M; — p;Q;) ori: (M; — pj) withl <i<r, 
where M; € Nreg and pj © Tree V1 <i <r. For any such grammar Greg we 
can construct an equivalent right-linear grammar G’ = (N’,T’, P’, S’) in the 
following way: 


N! = Nreg U {MGi1, Mii), Miia) | 1 Si <7}. 


For any rule i: (M; > pjQ;) © Preg ori: (Mj — pi) © Preg, 1 <i < r we 
will have in P’ the sequence of rules: 


Mi > Mii), Miia) @ Me2), Miz) & Mii3y, Miz) > PiQir 
Mi > M1), Miia) 7 MG2), Mo.2) 2 Miz), Mais) > Pi 


respectively. Moreover, P’ does not contain other rules excepting the rules 
considered above. 
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In other words, the only difference between the two grammars is that 
a new terminal is produced in grammar G’ after every fourth step of a 
derivation. 

Now let us construct a P system which simulates the derivation process 
of a regularly controlled grammar with appearance checking. The system 
will use only two membranes, one catalyst and promoters. The innermost 
membrane will contain the generative mechanism and the results of com- 
putation will be sent out to the skin membrane which will be the output 
membrane of the system (the reason is that the catalyst is used during the 
computation to inhibit the parallelism and it cannot be removed, so we can- 
not obtain the number 0 as the result of computation if we use only one 
membrane). In what follows we will discuss only the rules in the innermost 
membrane since the skin membrane does not execute any task (its role is 
only to collect the objects obtained during computation). 

The promoters will be generated by a mechanism like the one presented 
above (promoters will be actually terminal symbols from T’ and, therefore, 
they will be generated every fourth step). They will permit the execution 
of “context-free” rules in the “right" order — the order given by the regular 
mechanism. 

In order to correctly simulate the appearance checking mechanism we 
have to modify the rules in grammar G’ such that we replace each rule of 
type Mi3) — piQ; by rules of type: M(3) — piQif or Mii3) — piQia 
depending on how the object p; indicates a rule from F (in the regularly 
controlled grammar definition, the set F C P represents the appearance 
checking set of rules; we will use the object a to identify that a rule with 
the corresponding label p; is in the appearance checking set; if not, we will 
produce object f in the rule). We will also consider the same construction 
for the rules in G’ of type Mii3) > Pir i.e., Mi3) => pif or Mi3) — pia. 
This means that, in the definition of our P system, for the inner membrane, 
we will have rules of the following types: 

e Mi > Mia), Mia) > Mi), Maz) + Mis), Maz) > piQif if pi is 
not a label in the appearance checking set; 

¢ Mi > Mia), Miia) & Mg), Mi2) + Moa), Maz) > piQia if pi is a 
label in the appearance checking set; 

eM; -> Mi) ; Mii) => Miz) 7 Miz) =~ M3) 7 M3) =F pif if Pi is nota 
label in the appearance checking set; 

eM; > Mi) 7 Mit) =p M(2) ; Mi) =f M(3) ; Mi3) — pia if Pi isa 
label in the appearance checking set. 
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For a context-free rule (p : (A — «)) © Gor (Gcp = (Nee, Tce, Pce, Sce)) 
we will have the following sequence of rules in our P system: 
cA + cD&|p, 
pp, 

P fe Alp, 
DA. 

Here, without losing generality. We have considered that a € (NU Tout)* 
which means that if we apply the rule p : (A — «) we will send the termi- 
nal symbols to the output region (recall that we are interested only in the 
number of objects). 

This sequence of rules stands for the case when the context-free rule 
A — a can be applied and, therefore must be applied. In the case when the 
rule mentioned cannot be applied we have to decide if the promoter present 
indicates a rule with the label in the appearance checking set or not. 

First, let us consider the case when the promoter is not a label in the 
appearance checking set. In this case, recall that we deal with the following 
sequences of productions (from the regular mechanism): 
eM; > Mia)» Miia) 7 Miz), Maz) 2 MG3), M3) > piQif, — or 
¢ Mi > Miia), Muay > Maz), Mi2) 7 Miia) » Miz) > pif- 

As a result of applying these rules we will have in the inner region, 
among others, the objects p and f. Let us consider that, for this case, we 
have the rules: 


f > fia 
fi > fr, 
pi #ly, 
froa, 
#o#, 

# — #. 

The first two rules from this group are meant to delay the execution of 
the third rule because we are not “sure” if the rule cA — cDap is or is not 
applied. 

With a construction similar to the one above we can solve the case when 
we deal with rules that have labels in the appearance checking set. This 
means that the rules to be applied are of the types: 

e M; => Mii) ; Mat) 4 Miz) 7 M2) => M3) 7 M3) = piQia j or 
© Mi > Miia), Maa) + Mii), Mii2) + Mci3), Miia) > pia. 
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Here the difference from the previous case is that the trap symbol # is not 
generated if the rule cA — cDa|, cannot be applied. We only have to delete 
the promoter p’. It may interfere in the next steps of computation if it is 
not deleted. The rules below state the fact that if a rule is in the appearance 
checking set and it can not be applied even if it is indicated by the regular 
mechanism, then it can be skipped. 


aay, 
a, — az, 
P ee, May, 
ay — Xx. 

Finally, the initial configuration of the P system is composed of the start- 
ing symbol Sreg of the regulating mechanism, the starting symbol Scr of 
the context-free mechanism and the catalyst c. The system will evolve in a 
maximally parallel manner and its behavior is controlled by the catalyst and 
promoters. 

It can be seen that the descriptional complexity in terms of number of 
membranes, number of catalysts and promoters is the same as in the original 
proof in [12], but simulating a different computational universal mechanism. 


As a particular case we have the following result: 
Corollary 4.1 NOP(cat,, proR) = NRE. 


As can be seen, the promoters combined with one catalyst are sufficient 
to prove the computational universal capabilities of the P systems when 
using only context-free object rewriting rules. Also, a similar result, but for 
inhibitors not promoters, stands. 


Theorem 2. PsP2(cat,,inhR) = PsRE. 


As a particular case we have the following result: 


Corollary 4.2. NOP»2(cat,,inhR) = NRE. 


4.2 Computational universality — the accepting case 


The following theorems illustrate the computational universality (in their 
accepting variants) of P systems with object rewriting non-cooperative rules 
and promoters /inhibitors at the level of rules. The systems we propose sim- 
ulate the moves of deterministic register machines. Moreover, the obtained 
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P systems are also deterministic. As in the previous subsection, we give here 
only the proof sketch of the first theorem, for the details suggesting [19]. 


Theorem 3. PsIP2(cat1, proR) = PsRE. 


Proof. In order to prove this assertion we will simulate a n—register machine 
M = (n,P,i,h). At each time during the computation, the current contents 
of register j is represented by the multiplicity of the object aj. 

Formally, we define the P system 


im = (V, C, (1 [2 lo ]1, 1 = O,W2, Ry = ©, Ro,1), 


where: 
V = {aj,Aj,$;|1<j<n}U{F,T}U {e,e' | (e: add(j), f) € PHU 
{e,e’,e” | (e: sub(j), f,z) € P}, 
C= te}, 
W2 = {c,e,a//,1 <j<n,k;€N}, 
and R> is defined as follows: 


e for each instruction (e : add(j), f) € P, we add to Ro the rules: 
ee A 
Cc caj| a, 
e’ > f 
e for each instruction (e : sub(j), f,z) € P, we add to Ro the rules: 
e—e'TS; 
caj > cF|s; 
Sj aA 
e’ 2% el! 
To T' 
e”’ = fle 
FA 
Tv! aN qT" 
el! > Z\r" 
T” >A 
e for the instruction (h : halt) € P, we add to R> the rules: 
aj >Hi, 1 Sj<n 
h—+>aA 
e the rule # > #is added to Ro, 
e no other rules are added to Ro. 
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The system works as following. Initially the P system starts the com- 
putation whit the objects att, ee ak, the catalyst c and the label e of the 
first instruction of the register machine we want to simulate in its input re- 
gion (region 2). The vector (k;,--- ,kn) represents the vector that has to be 
accepted by our P system. 


Theorem 4. PsIP2(caty,inhR) = PsRE. 


5 Universality results - using P systems with traces and 
symport/antiport rules 


The initial results presented in [20] (with which universality could be ob- 
tained using antiport rules of an arbitrary weight, as well as promoters or 
inhibitors), were significantly improved in [21]. 

Before presenting the latter we should mention that by /RE we mean 
the family of recursively enumerable languages over alphabets of size I, 
and that /LP;,(sym;,anti,) denotes the family of languages over alphabets 
of size 1 defined by traces of P systems with symport/antiport whit at most 
m membranes, symport of weight at most j, and antiport of weight at most 
k. 

Here are the results presented in [21]: 


ILP)44(symg, antiz) = 1RE 


ILP).1(syms3, antig) = 1RE 
ILPi49(symy,antig) = 1RE 


For details regarding the proof, the reader is asked to consult the above 
mentioned paper. 

It can be seen that the number of membranes used to obtain universality 
is strictly dependent on the size of the chosen alphabet, so an infinite hier- 
archy is created. The question we try to answer in the next subsection (more 
precisely we suggest some answers to it), is: Can /RE be obtained with P 
systems in which the number of membranes does not depend on /? 
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5.1 Ways of obtaining |RE with the number of membranes not 
depending on / 


Several travelers 


The firs way of decreasing the number of membranes is to consider several 
travelers. Let us suppose that alphabet T (which was mentioned in section 
2.3) has k components (travelers) and the considered P system contains m 
membranes. 

In this case V consists of k * m symbols Mi, jr where 1 <i <k,and1 < 
j <m. Let 7 = CyC2...Cy, k > 1, be a halting computation and let C;(T) = 
{a;;| t; be in membrane j}. 

We consider trace(T,7) = {w ,W2,...,Wx|w; € V*, ¥y(w;) = ¥v(C,(T)) $, 
i.e. w; is a linear arrangement of C;(T) (more precisely, we allow here any 
permutation). Hence, the language defined by a P system with several trav- 
elers is given by: LT(II) = {h(trace(T,c))| o is a halting computation in 
TT}. 

We should also mention that the language LT(IT) can be over k * m sym- 
bols (provided by the k travelers and m membranes considered). 


Using an inverse morphism 


The second idea (invitation) we propose is to consider an inverse morphism. 
Let us suppose we have the following morphism h : V* — U*. Then, the 
inverse morphism of h is defined as h~! : U* — 2”", where h7!(y) = {x € 
V*| a=}, ye UO, 

Now let us consider the following example: let us have two alphabets, 
V = {a4,a2,...,a,}, U = {0,1} and the mapping h defined as above, 
where h(a;) = 0'1. It is obvious that this mapping is injective; hence, 
card(h~!(y)) = 1. We choose now a recursively enumerable language 
L C V* and we write it in the form L = h~!(h(L)). It it obvious (from 
the way we defined h) that choosing L from kRE, h(L) € 2RE. 

Using now the results from [21] we conclude that: 


h(L) € 2LP3(symo, anti), 


and 
h(L) € 2LP3(sym3, antig). 
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Changing the labels of membranes 


Our final proposed answer to the core question of this section is to consider 
the change of the labels of the membranes. The idea is “borrowed" from the 
P systems formalisms which deal with active membranes (details in [17]). 

Our (symport) rules of the form (x,in), (x,out) can be rewritten as 
x[]; > [x];, and [x]; — [ ];x, respectively, while the rule (x, out; y,in) (the 
case of antiport) can be rewritten as y|x]; > [y];x. Generalizing, we can con- 
sider that whenever an object enters or exits a membrane it can change its 
label. Thus, our rules will become: x[]; — [x]j, [x]; > []jx, and y[x]; > [y]jx, 
respectively. 

Once this point has been reached another problem might arise: namely, 
the conflict with other labels. To be more precise, when label i changes to j, 
a different membrane may have the latter, which is of course, quite unwel- 
come. For this reason, we thought of three ways of avoiding this conflict. 
The first is the sequential use of rules. The second solution is to considera co- 
herent set of rules of the form (i,j), all passing from i to j. And the last one is 
to use at most one rule which changes the label and, in parallel, other rules 
which do not change the label of the membranes. The reader is encouraged 
to try them. 


VV 


6 Final remarks and future work 


We have put together some powerful results which arise from the mecha- 
nisms of P systems with promoters/inhibitors, and symport/antiport con- 
sidering the trace of a certain object through membranes. 

We have also recalled here a new way of defining the result of a compu- 
tation in a P system with symport and antiport rules: namely, taking into 
consideration the trace of a specified object through membranes during a 
computation. This idea is attractive because it leads to a string associated 
with a computation rather than a number. We believe this last idea is worth 
considering also for other classes of P systems. Of particular interest are sys- 
tems whose computation is based entirely on communication, such as those 
involving carriers [5] and various classes of systems with symport/antiport 
rules. For instance, we can consider P systems with symport/antiport rules 
with other control mechanisms on the use of rules or with other arrange- 
ments of membranes — (e.g., P systems working on networks of elementary 
membranes [8]). 
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In the last section we proposed three different ways of solving the infinite 
hierarchy problem related to the direct relationship between the necessary 
number of the membranes in a P system needed to generate a RE language. 

We also want to propose here that the relationship of P systems with 
traces and Gauss codes be studied. (Gauss codes are one of the oldest prob- 
lems leading to a formal language). Here is how they are obtained: consider 
a planar closed curve with simple crossing points (i.e. it is not a tangent 
point and the curve crosses itself only once that point). Assign the numbers 
1,2,...,n to the n crossing points of a given curve c. A sequence x(c) con- 
taining exactly two occurrences of each i (i between 1 and n) and describing 
the passing of the curve through the crossing points is called a Gauss code. 
More details about Gauss codes can be found in [22].) It seems natural, in a 
way, to think that the traveler considered here will “jump" on a Gauss curve 
and follow its path, but so far no work has been done in this direction. 


References 


1. B. Alberts et al., Essential Cell Biology. An Introduction to the Molecular Biology of 
the Cell, Garland Publ. Inc., New York, London, 1998. 

2. LI. Ardelean, The relevance of cell membranes for P systems. General aspects, 
Fundamenta Informaticae, 49, 1-3 (2002). 

3. J. Dassow, Gh. Paun, Regulated Rewriting in Formal Language Theory, Springer- 
Verlag, Berlin, 1989. 

4. C. Martin-Vide, A. Paun, Gh. Paun, G. Rozenberg, Membrane systems with cou- 
pled transport: Universality and normal forms, submitted, 2002. 

5. C. Martin-Vide, Gh. Paun, G. Rozenberg, Membrane systems with carriers, The- 
oretical Computer Sci., 270 (2002), 779-796. 

6. A. Paun, Gh. Pdun, The power of communication: P systems with sym- 
port/antiport, New Generation Computers, to appear. 

7. A. Paun, Gh. Paun, A. Rodriguez-Paton, Further remarks on P systems with 
symport rules, Ann. Univ. Al.I. Cuza, Iasi, to appear. 

8. A. Paun, Gh. Paun, G. Rozenberg, Computing by communication in networks of 
membranes, submitted, 2001. 

9. Gh. Paun, Computing with membranes, Journal of Computer and System Sciences, 
61, 1 (2000), 108-143, and Turku Center for Computer Science-TUCS Report No 208, 
1998 (www.tucs.fi). 

10. G. Rozenberg, A. Salomaa, eds., Handbook of Formal Languages, 3 volumes, 
Springer-Verlag, Berlin, 1997. 
11. A. Salomaa, Formal Languages, Academic Press, New York, 1973. 


; TRIANGLE 6 e December 2011 


88 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22, 


M. Ionescu 


P. Bottoni, C. Martin-Vide, Gh. Péun, G. Rozenberg, Membrane Systems with 
Promoters/ Inhibitors Acta Informatica, 38, 10 (2002), 695-720. 

R. Ceterchi, D. Sburlan, Simulating Boolean Circuits with P Systems, Workshop 
on Membrane Computing WMC-Tarragona 2003, (A. Alhazov, C. Martin-Vide, G. 
Paun, eds), TR 28/03, URV Tarragona, 2003. 

R. Freund, L. Kari, M. Oswald, P. Sosik, Computationally Universal P Systems 
without Priorities: Two Catalysts Are Sufficient; submitted 2003. 

S. Khrisna, A. Paun, Three Universality Results on P Systems, Workshop on Mem- 
brane Computing WMC-Tarragona 2003, (A. Alhazov, C. Martin-Vide, G. Paun, 
eds), TR 28/03, URV Tarragona, 2003, 198-206. 

MLL. Minsky, Finite and Infinite Machines, Prentice Hall, Englewood Cliffs, 1967. 
Gh. Paun, Membrane Computing. An Introduction, Springer-Verlag, Berlin, 2002. 
Gh. Paun, G. Rozenberg, A Guide to Membrane Computing, Theoretical Computer 
Science, 287, 1 (2002), 73-100. 

M. Ionescu, D. Sburlan, On P Systems with Symport/Antiport, Journal of Univer- 
sal Computer Science, 10(5), 2004, 581-600. 

M. Ionescu, C. Martin-Vide, G. Paun, P Systems with Symport/Antiport: The 
Trace of Objects, Grammars, 5(2), 2002, 65-70. 

P. Frisco, H.J. Hoogeboom, Simulating Counter Automata by P Systems with 
Symport/Antiport, Lecture Notes in Computer Science, 2597, 2002, 288-301. 

C. Martin-Vide, V. Mitrana, Gh. Paun (eds.), Formal Languages and Applications, 
Springer-Verlag, 2004, 20-21. 


; TRIANGLE 6 e December 2011 


Aggregation with Recombination Patterns 


M. Dolores Jiménez-Lépez 


Research Group on Mathematical Linguistics, 
Rovira i Vigili University 

Tarragona, Spain 

E-mail: mariadolores. jimenez@urv.cat 


Summary. In this paper, we show the commonalities between aggregation processes 
in Natural Language Generation and recombination patterns, a framework introduced 
recently as a way of generating complex sentences in natural languages using very 
simple recombination —and therefore biological— rules. By showing similarities be- 
tween these two mechanisms, we suggest the possibility of carrying out aggregation 
by means of recombination patterns. We also refer to the possibility of using such 
a biological-motivated framework in the design of efficient and simple natural lan- 
guage generation devices. 


1 Introduction 


Natural Language Generation (NLG) may be seen as the technique of let- 
ting a computer automatically create natural language out of a computa- 
tional representation. In order to generate natural language from computa- 
tional representations, different processes must be carried out. First, from 
the knowledge base, it must be decided what to say: this is the so-called con- 
tent selection phase. Then, during the so-called text plan, the order in which 
the sentences should be generated to make the text coherent is decided. 
Then sentence planning is carried out. And finally, in the so-called surface 
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generation, the syntactic structures and lexical choices (how to say it) are per- 
formed. Part of the sentence planning is the task of aggregation. 


Aggregation has become a popular research topic in the field of NLG 
[8, 10]. It is usually described as a process in generation that consists of 
removing redundant information without losing information. Research on 
this topic has proved to be important in NLG because without aggregation 
automatically generated text is often very poor. In contrast, aggregation im- 
proves the quality of generated text by making it more fluent, concise and 
easy to read. Vastly different approaches to aggregation can be found in the 
literature and they are often not compatible. The definition of aggregation 
provided in [3], and reported below, is considered to be one of the best since 
it summarizes what other researchers consider this task to be: 


‘Functioning as one or a set of processes acting on some inter- 
mediate text structures in text planning, aggregation decides which 
pieces of structures can be combined together to be realized as com- 
plex sentences later on so that a concise and cohesive text can be 
generated while the meaning of the text is kept almost the same as 
that without aggregation.’ 


In this paper we want to show the commonalities between that important 
area in NLG and a formalism that have recently been defined in [9] with the 
aim of showing that the mechanism of recombination —found in Biology— 
also works in natural language. This new framework -called Recombination 
Patterns— shows that it is possible to generate a natural language (or an im- 
portant part of it) by starting with a base -composed of a finite number of 
simple sentences and words- and by applying a small number of recombi- 
nation rules. Initially, the formalism is defined for the generation of complex 
sentences in English, Italian and Spanish. 


It is accepted that people do aggregation all the time to make natural lan- 
guage expressions shorter, non-redundant and easy to read. On the other 
hand, it seems that recombination patterns can explain in a very natural fash- 
ion the way in which people process natural language. So, both aggrega- 
tion and recombination patterns could be found in the human processing of 
natural language. Now, since software engineering tools, data bases or ex- 
pert systems are often highly redundant, aggregation processes have been 
defined and implemented for NLG devices. What we claim here is that re- 
combination patterns also — such as aggregation- can also be applied to the 
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field of NLG and that such a new formalism may facilitate the definition of 
simple and efficient natural language generation devices. 


The great deal of similarities between aggregation processes that have 
been defined in the area of NLG and recombination patterns lead us to 
suggest here that it may be possible to carry out aggregation by means of 
recombination patterns. Moreover, as we have already said, we suggest that 
the naturalness of recombination patterns may enable them to be used in 
the area of NLG. 


The paper is divided into six sections. Section 2 and 3 briefly introduce 
aggregation and recombination patterns, respectively. Section 4 compares 
two issues. Section 5 suggests that it may be possible to used recombination 
patterns in natural language generation. Finally, some final remarks and 
basic references are presented. 


2 An overview of aggregation 


Researchers in the field have different understandings of what aggregation 
is. According to [11], aggregation is taken to be 1) redundancy elimination, 
2) abbreviation, 3) text structure combination for concise and coherent text 
generation, 4) the combination of text components at any level to achieve a 
cohesive text, 5) the syntactic expression of concise and tightly constructed 
text and the generation of fluent, more readable and less boring text, etc. 
The scope of this section is, as far as possible, to shed light on this topic by 
providing answers to the questions: what, why, when and where aggrega- 
tion. At the end of the section, the reader should have a clear idea of what 
aggregation is, and will be able to understand the comparison we will make 
with recombination patterns. 


What is aggregation? Aggregation is the process of removing redundant in- 
formation in a text without losing any information. In [11], two usages 
of the term ‘aggregation’ are pointed out: 


1. the narrow sense: ‘Aggregation is any process which maps one or 
more structures into another structure which gives rise to text 
which is more x-aggregated than would otherwise be the case. X- 
aggregated text is text which contains no multiple nonpronominal 
overt realizations of any propositional content.’ 
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2. the broad sense: ‘Aggregation is the combination of two or more lin- 
guistic structures into a single linguistic structure which contributes 
to sentence structuring and construction.’ 


According to [16], aggregation is not a process, a task, or a set of de- 
cisions in generation, but a reason for decisions to be made in a certain 
way. Aggregation is not an end in itself. Aggregation is an emergent 
phenomenon. From this perspective, aggregation is neither a process 
nor a goal of processes; rather, it is a characteristic of texts which have 
been generated properly. 


Why is aggregation done? There are as many reasons as there are goals. 


People and systems must perform aggregation to make their text more 
readable, understandable, fluid, concise, coherent, cohesive...: not do- 
ing so risks the reader’s misunderstanding or irritation. We assume ax- 
iomatically that shorter and less redundant text is better text. 


When is aggregation done? Assuming that in NLG the tasks carried out are 


content selection, sentence planning and surface generation, we may say that 
aggregation takes place after content selection and before surface genera- 
tion. So, aggregation is part of the sentence planning. Anyway it can be 
carried out whenever the appropriate structures arise, it can take place 
during every phase of NLG except during content selection and surface 
form generation. This leads to the view that there need not be a specific 
aggregation module in a generation pipeline. Rather aggregation can be 
done whenever possible on an opportunistic basis. 


Where is aggregation done? Wherever the appropriate structures arise. In 


fact, there seems to be no aspect of language generation which can be 
excluded from a thorough consideration of aggregation. 


What is aggregation done to/on? In order to answer this question, we can 


refer to the several types of aggregation that have been described. The 
typology suggests that it can be done at any level of linguistic repre- 
sentation. Types of aggregation differ from author to author. [10] distin- 
guishes: 1) Embedding or hypotactic aggregation; 2) paratactic aggregation; 
and 3) lexical aggregation. A similar classification can be found in [3] 
where the types of aggregation are: 1) Lexical aggregation; 2) embedding; 
3) hypotactic aggregation, and 4) paratactic aggregation. Now, combining 
the information from [11] and [5] we offer the following classification of 
general types of aggregation: 
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1. Conceptual aggregation. Conceptual aggregation typically reduces the 
number of propositions in the message while increasing the com- 
plexity of the value of some conceptual role. 

2. Discourse aggregation. This reduces the complexity of rhetorical struc- 
ture but increases the complexity of one of the propositional leaves. 

3. Semantic aggregation. We take it that conceptual information is non- 
linguistic, language-independent, domain knowledge and that se- 
mantic information is linguistic, language-dependent representa- 
tions of meaning. An example of semantic aggregation might be 
the mapping of the meanings of “Jamie is Chris’s sister’ and ‘Chris 
is Jamie’s brother’ to the meaning of ‘Chris and Jamie are brother and 
sister’. 

4. Syntactic aggregation. This removes redundant information, but leaves 
(at least) one item in the text to carry the meaning explicitly. This is 
carried out at a pure syntactic level with no information loss about 
the content of the aggregation items. It is the most common form of 
aggregation. The two most common rules are (a) the subject group- 
ing rule and (b) the predicate grouping rule. 

5. Elision. This removes information that can be inferred and leaves no 
items in the text to carry the information explicitly but the informa- 
tion remains there implicitly. 

6. Lexical aggregation. This replaces a set of items with a new item, while 
the overall meaning is kept intact. 

7. Referential aggregation. This replaces redundant information with 
some sort of trace, such as a pronoun, to carry the information ex- 
plicitly. 


The systems which implement aggregation are designed in a number of 
different ways. Their architectures can be classified under the following four 
categories, on the basis of where aggregation is performed: 


1. Independent sentence-planning module. This treats aggregation as a mi- 
croplanning subtask which happens after text planning and before sur- 
face realization. When it runs, the full contents have been selected and 
the rhetorical structure of the text is determined. The module attempts 
to improve the text quality by performing aggregation on propositions 
which are either adjacent or close enough to be brought together, but 
the interaction with other NLG tasks is minimal. 
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2. Opportunistic text planning. This views aggregation as a tool used by text 
planning or content selection. It is used to generate concise summaries 
of some input data. The first planning task is to determine what in- 
formation is essential, and what information is optional. All essential 
information is included in the text plan, but then aggregation, espe- 
cially embedding, is used to determine which optional facts should be 
included. If an optional fact can be expressed concisely, it is included; 
otherwise it is left out. 

3. Discourse Organization Module. This approach combines ideas from the 
first two approaches. It assumes that the content is selected as a first 
independent step and that all facts passed on must be included, but 
then it uses the technique of opportunistic planning to build the text 
structure, yielding a concise and flowing text. It is one of the best ways to 
perform aggregation when content determination can be done up-front 
without regard to realization issues. This approach gives the application 
a high degree of flexibility. 

4. Revision-based generation. This does not contrast with the former types; 
rather it is a particular feature of some systems in the other categories. 
It is based on human writing analysis which concludes that writing 
is usually done in three heavily interwoven (and recursively nested) 
phrases: planning, sentence generation, revision. Since humans generate 
multiple drafts before producing a final text, it is reasonable for NLG 
systems to do the same. 


3 Introducing recombination patterns 


In [9], we define recombination patterns for natural language syntax with the 
aim of showing that the mechanism by which genetic material is merged 
-that is, recombination— works and is also valid in natural languages. The 
main scope of the idea we propose is not so much to explain the human 
processing of language as to be able to offer a formalism that may simplify 
generation mechanisms for natural language; that is to define a formalism 
that may allow the construction of natural language processing systems that 
are as simple and efficient as possible. We are somehow convinced that re- 
combination may well explain the way in which speakers combine linguis- 
tic elements (words, phrases, sentences) that they already know in order to 
construct new linguistic structures. More specifically, our research focus on 
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the possibility of generating a natural language (or an important part of it) 
using a finite number of words, phrases, sentences... and by only applying 
recombination rules. In order to show that recombination may well describe 
natural syntax we have focused, up to now, on the study of complex sen- 
tences in three languages: English, Italian and Spanish. We have defined dif- 
ferent formulae -called patterns— for coordinate and subordinate sentences 
in these three natural languages. Up to now, we have defined 379 patterns 
that allow us to generate complex sentences in English, Italian and Span- 
ish by the recombination of simple sentences. Each pattern is a very simple 
formalism divided into three different parts: 


1. The first one -called pattern recognition— defines in terms of some basic 
categories —proposition, substantive, noun, etc.— the two propositions 
we always have at hand in order to generate complex sentences. 

2. The second part -called pattern recombination— is where the application 
of different recombination rules (insertion, deletion, transposition, trans- 
formation, etc.) takes place. Taking into account that in every pattern we 
start by having at hand two propositions, this phase starts by placing 
Proposition 2 in some place with reference to Proposition 1. It does so 
by performing either 
e an initial insertion. In this type of insertion, Proposition 2 is situ- 

ated before Proposition 1. The typical instruction here says: Insert 
Prop.2 before Prop.1; 

e a median insertion. Here, Proposition 2 is placed after the Subject of 
Proposition 1, the instruction being something like Insert Prop.2 
after Subject in Prop.1;or 

e a final insertion. Final insertion situates Proposition 2 after Proposi- 
tion 1. So, the instruction here says: Insert Prop.2 after Prop.1. 

After the positioning of Proposition 2 with respect to Proposition 1, 

some recombination rules are applied. 

3. And finally, in the last part -called grammatical adaptation— necessary 
changes (insertion of elements, transformation of verbal tenses and 
modes...) in order to obtain grammatical sentences are made. 


Recombination rules applied during the pattern recombination and the 
grammatical adaptation phases are only of the following five types: 
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1. Insertion, where some element or structure is inserted at some point of 
the string. 

2. Deletion, where some elements of the structure are deleted. 

3. Transposition, where some elements are shifted to another location in the 
string. 

4. Inversion, where elements in the string can be reversed. 

5. Transformation, where some elements change their form. 


One of our goals in defining such formulae —-even though not the most 
important one- is to do without the traditional grammatical division of com- 
plex sentences and to speak just of patterns that recombine some acceptable 
structures and give some other acceptable structures. Patterns defined, then, 
give up the usual terminology (relative sentences, noun sentences and the 
like) and are organized according the following criteria: 


e Variables used. We define patterns by specifying the variables used. We 
specify the variables at hand when we start the application of the rules 
sequence (in) and the result obtained after the application of such a se- 
quence (out). 

e ‘Type of insertion. Taking into account that in every pattern we start by 
having at hand two propositions, by type of insertion we mean the place 
-the beginning, middle, end— where the second proposition is situated 
with respect to the first one. 

e Rules sequence. Sequence of rules applied in order to obtain the complex 
sentence from the two simple sentences considered. 

e Element introduced. The word we introduce in order to generate a com- 
plex sentence from the recombination of two simple ones. 


The following example shows what patterns look like. In this example, 
the four criteria above can be observed. Notice, also, that the three different 
parts mentioned above are distinguished: (1) pattern recognition (roman); 
(2) pattern recombination (italics); and (3) grammatical adaptation (boldface). 
Moreover, we consider that each of the simple sentences in the so-called 
pattern recognition phase belongs to a Corpus (C). 


By using the pattern in Table 1, sentences like “The book which he read 
belonged to John’, ‘The umbrella which she took is mine’, ‘The report which Karen 
submitted implicated several of her friends’ or “The books which he had recom- 
mended were unobtainable’ can be generated. In order to show how patterns 
work, we show how to generate the first sentence. The reader can easily 
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Pattern: (in: x, y, w, Z, 8; out: x WHICH w s y) 
Rules Sequence: Median Insertion, Transposition, Deletion, Insertion 


English: WHICH 


xy = Prop. 1: Proposition € C 
wz = Prop. 2: Proposition € C 
x: Substantive € C 

w: Substantive € C 

y: Predicate € C 

Z=s x: Predicate € C 

s: Verb € C 


x (ws x) y: Quasi Proposition Insert Prop.2 after x in Prop.1 
x(x ws) y: Quasi Proposition Transpose X in Prop.2 to initial position in Prop.2 


x WHICH w s y: Proposition Delete x in Prop.2 & Insert WHICH 


Table 1. Example of a Recombination Pattern. 


check how to obtain the other three —and sentences of the same type— by 
using the same pattern. As shown in the first part of the pattern in Table 1, 
we start by identifying two simple sentences and by defining them in terms 
of basic categories as shown below: 


The book belonged to John:= x y 
He read the book:= w z; z =s x 


— The book:= x 

— belonged to John:= y 
—- He:=w 

— read the book:= z 

— read:=s 


Now, once we know the elements we have at hand, we start the phase of 
pattern recombination. In our example, we perform the two recombina- 
tion rules —-Insert Prop.2 after x in Prop.1; Transpose x in Prop.2 
to initial position in Prop.2~ indicated in the pattern above: 


The book (he read the book) belonged to John:= x (ws x) y 
The book (the book he read) belonged to John:= x (x ws) y 
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And finally, in the grammatical adaptation phase we introduce the elements 
required in order to obtain a grammatical sentence. By performing the rule 
indicated in the pattern—Delete x in Prop.2 & Insert WHICH- we obtain 
the following structure: 


The book WHICH he read belonged to John:= x which ws y 


The patterns we have defined show how to combine and modify sim- 
ple sentences in order to generate complex ones. What is important in our 
formalism is that we can generate complex structures by using simple sen- 
tences and by applying only five types of very simple rules: insertion, inver- 
sion, deletion, transposition and transformation. This simplifies the generation 
of natural languages quite considerably. From the patterns defined it may 
not be difficult to define an algorithm that specifies the steps required to 
generate complex sentences from simple ones. We also have the intuition 
that such an algorithm may be language independent, since patterns de- 
fined for the three languages considered are very similar -they only have 
significant differences in the so-called grammatical adaptation phase. The 
definition of an algorithm of these characteristics may have important im- 
plications in the fields of natural language generation and language univer- 
sals, since it shows that formal methods can be defined to generate natural 
languages independently of specific languages. Such a formalism may also 
have interesting implications in the areas of cognitive science or machine 
learning theory since —as pointed out by [1]- intuitively, maybe recombina- 
tion rules are more feasible than rewriting systems to explain how humans 
process language. 


4 Aggregation versus recombination patterns 


The task of aggregation is to combine simple representations to form a com- 
plex one. Recombination patterns fit with this broad sense of aggregation 
understood as the ‘combination’ of any two linguistic structures to produce 
a third more ‘complex’ structure. 


According to [11], regardless of the level of representation, aggrega- 
tion is performed on linguistic structures (unless they are predominantly 
conceptual). Thus, theories of coordination and subordination are relevant. 
However, there is almost no mention of linguistic theory in the aggrega- 
tion literature. There are several theories of coordination and subordination 
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in the linguistics literature which could be used, but this appears not to 
have been done. Instead, relatively simple and superficial approaches, es- 
pecially in syntactic aggregation, have been adopted. Indeed, compared to 
the linguistic theories available, the treatments of coordination in particular 
implemented NLG systems have been relatively trivial. The conclusion we 
draw from this is that any successful system which achieves ‘aggregated 
text’ will have to incorporate linguistic knowledge about coordination and 
extraction, ellipsis, focus, centering and discourse and lexical semantics. Re- 
combination patterns can offer a very simple linguistic theory of syntax that 
may contribute to the improvement of aggregation processes in NLG by 
incorporating a natural and implementable treatment of coordination and 
subordination. 


In [12], Casper (Clause Aggregation in Sentence PlannER) is introduced 
as a sentence planner which focuses on generating concise sentences —an 
expression is more concise than another expression if it conveys the same 
amount of information in fewer words. Complex sentences combine clauses 
and are more concise than the corresponding simple sentences because mul- 
tiple references to the recurring entities are removed. Clause aggregation 
can happen at three levels: 1) inferential, 2) rhetorical, and 3) linguistic. 
Here we are interested only in the linguistic level. At this level, lexical and 
syntactic information are used to combine clauses. We can distinguish the 
following two types of linguistic aggregation operators: 


1. Hypotactic operators. The term hypotaxis describes the relation between a 
dependent element and its dominant clause. To aggregate two proposi- 
tions using hypotactic operators, the propositions must share some en- 
tities in common. When they do, hypotactic operators try to transform 
one of the clauses into a modifier. Since the goal is to generate concise 
text, CASPER prefers to transform a proposition into an adjective if pos- 
sible, then a PP, a participle clause, and if all else fails, a relative clause. 

2. Paratactic operators. Paratactic aggregation operators combine clauses us- 
ing constructions of equal status, such as coordination. Two approaches 
to combining propositions using coordinate constructions can be distin- 
guished. In the first approach, adjacent propositions that have only one 
slot containing distinct elements are collapsed into one proposition with 
one conjoined slot containing the distinct elements. In the second ap- 
proach, the conjoined propositions have distinct elements in more than 
one slot. To combine them, each conjoined proposition is generated, but 
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deletion rules are used to ensure the resulting sentence has the correct 
ellipsis. 


If we consider the above two types of operators used in linguistic aggre- 
gation and we look at some examples of aggregation rules, we will realize 
the similarities between the technique used in Natural Language Genera- 
tion and recombination patterns. Those similarities could support our idea 
of applying our framework to the field of aggregation. Moreover, we claim 
that recombination patterns provide a simple and natural linguistic theory 
of coordination and subordination that can be easily used in those syntactic 
aggregation processes. 


We can distinguish between embedding and paratactic aggregation. Em- 
bedding is considered the most interesting and powerful type of aggrega- 
tion. It can be used to express multiple facts about an entity concisely, and 
it is an effective tool in opportunistic generation. Three types of embedding 
are given in [15]: nominal, adjectival, and adverbial. Each type has a number 
of possible realizations with different complexities: 


1. Nominal. A nominal can be embedded as a noun or an appositive phrase. 
For example, given ‘King made this jewel’; ‘King is a Scottish designer’, the 
second proposition can be embedded as a noun: ‘The Scottish designer 
King made this jewel’, or as a appositive phrase: ‘King, a Scottish designer, 
made this jewel’. 

2. Adjectival. An adjectival can be embedded as an adjective, a preposi- 
tional phrase or a relative clause. For example, given ‘A man bought the 
picture’; ‘The man had blond hair’, the second proposition can be embed- 
ded as an adjective: ‘A blond man bought the picture’, as a prepositional 
phrase: ‘A blond man with blond hair bought the picture’, or as a relative 
clause: ‘A man who had blond hair bought the picture’. 

3. Adverbial. An adverbial can be realized as an adverb or a prepositional 
phrase. For example, given ‘Paula danced with Peter’; ‘She was willing’, 
the second proposition can be embedded as an adverb: ‘Paula danced 
with Peter willingly’, or as a prepositional phrase: ‘Paula danced with Peter 
with willingness’. 


When many forms of embedding are possible for the same proposition, 
prefer the simplest one. 


In [9] we have defined recombination patterns that can account in a very 
simple way for the three types of embedding referred in [15]. However, in 
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order to show the similarities between aggregation processes and recombi- 
nation patterns we focus here on paratactic aggregation. Paratactic aggre- 
gation consists of conjoining two or more propositions with the help of a 
coordinating conjunction. Redundant information can be elided so that the 
conjunction can be found at any depth in the sentence structure. For exam- 
ple: ‘John has a car’; ‘John drives to school’ can be aggregated to ‘John has a car 
and drives to school’. At a deeper nesting level, ‘John walks with Mary’; ‘John 
walks with Jane’ can be aggregated to ‘John walks with Mary and Jane’. 


In what follows we will use the following abbreviations: S = Subject; P = 
Predicate; Do = Direct object; Pc = Predicative Subject Complement; Conn 
= Connectives, i.e. And, Or, Xor; Xor = exclusive or. Moreover, we assume 
that a normal English clause has the following order S P Do. Two examples 
of aggregation rules are shown in Tables 2 and 3. 


Definition of Predicate and Direct Object grouping (PDO-grouping): 


S;PDo Conn S2PDo Conn SyPDo —> S$; Conn Sp Conn... Sy»PDo 


Table 2. Example of an Aggregation Rule: PDO-grouping. 


Definition of Subject and Predicate grouping (SP-grouping): 


SP(Do, Xor Pc,) Conn SP(Do, Xor Pcz) Conn... SP(Doy, Xor Pen) —> SP(Do, Xor 
Pc) Conn (Doz Xor Pcy) Conn (Doy Xor Pen) 


Table 3. Example of an Aggregation Rule: SP-grouping. 


If we apply the aggregation rule in Table 2 to the following two sen- 
tences: 1) ‘John is a student’ and 2) ‘Mary is a student’, the aggregated sentence 
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obtained is: ‘John and Mary are students’, where the grouped phrase is: ‘are 
students’. 


Now, if we apply the aggregation rule in Table 3 to: ‘John is a boy’ and 
‘John is tall’, the result after aggregation with Subject and Predicate Group- 
ing is : ‘John is a boy and tall’, where the grouped phrase is: ‘John is’. 


If we compare the aggregation rules in Tables 2 and 3 with the recombi- 
nation patterns in Tables 4 and 5, the similarities are evident. 


PATTERN: AND (Median Insertion) 
x y: Proposition € C 
w y: Proposition € C 
x: Substantive € C 
w: Substantive € C 


x (w y) y: Quasi Proposition Insert Prop.2 after Subject in Prop.1 


x AND (w y) y: Quasi Proposition Insert AND before Prop.2 
x AND w y: Proposition Delete in Prop.2 elements present in Prop.1 


Table 4. Recombination Pattern: AND (Median Insertion). 


By using the recombination pattern in Table 4, we can obtain an aggregated 
sentence like ‘Gordon and Shirley missed the meeting’ by performing the steps 
established in the pattern as follows: 


e Gordon missed the meeting:= x y 
e Shirley missed the meeting:= w y 


Gordon:= x 
missed the meeting:= y 
Shirley:= w 


Gordon (Shirley missed the meeting) missed the meeting:= x (w y) y 


Gordon AND (Shirley missed the meeting) missed the meeting:= x 
and (w y) y 
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PATTERN: AND (Final Insertion) 
x y: Proposition € C 
x z: Proposition € C 
x: Substantive € C 


x y (x z): Quasi Proposition Insert Prop.2 after Prop.1 


x y AND (x z): Quasi Proposition Insert AND after Prop.1 
x y AND z: Proposition Delete in Prop.2 elements present in Prop.1 


Table 5. Recombination Pattern: AND (Final Insertion). 


Gordon AND Shirley missed the meeting:= x and w y 


Now if we apply the recombination pattern in Table 5, we can obtain 
paratactic aggregations such as the one in “Smith hit the ball and ran to first 
base’ by performing the rules stated in the pattern: 


e Smith hit the ball:= x y 

e Smith ran to first base:= x z 
Smith:= x 
hit the ball:= y 
ran to first base:= z 


Smith hit the ball (Smith ran to first base):= x y (x z) 


Smith hit the ball AND (Smith ran to first base):= x y and (x z) 
Smith hit the ball AND ran to first base:= x y and z 


What we have tried to show in this section is that recombination patterns 
and rules proposed in the field of aggregation are very similar. In fact, the 
same result can be obtained by applying an aggregation rule, such as the 
ones in Tables 2 and 3, or by applying a recombination pattern such as the 
ones defined in Tables 4 and 5. Therefore, what we claim here is that aggre- 
gation may be carried out by using recombination patterns, an integrative 
approach to natural syntax coming from formal languages, biology and lin- 
guistics. By offering a simple, natural and implementable new approach to 
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syntax, recombination patterns might provide the missing linguistic theory 
in most of the implemented aggregation systems. 


5 NLG with recombination patterns 


In this section we would like to suggest the possibility of applying recom- 
bination patterns to natural language generation. What we present here is 
just a preliminary intuition about this possibility, nothing else. A great deal 
of research should be done in this direction in order to actually define an 
NLG system based on the formal framework introduced in this paper. 

In general it is accepted that in a Natural Language Generation System 
the following phases can be distinguished: 


e Content selection: from the abundant knowledge base we select what to 
say. This phase packages information as verb-based, clause-sized propo- 
sitions, each of which is realized as a single sentence. Many of these 
propositions share common features, such as the entity being described. 
If a generation system simply generates each proposition as a sentence, 
the output will contain many repetitive and redundant references to 
common features. 

e ‘Text plan: decides the order in which the sentences should be generated 
to make the text coherent. 

e Sentence planning: its main task is aggregation —the combining of seman- 
tically related propositions in order to produce concise and fluent ex- 
pressions. The goal or the sentence planner is to transform a set of input 
propositions into a minimum number of words under lexical, grammat- 
ical and pragmatic constraints. This transformation process occurs in 
multiple stages. In each stage, a set of combining operators is applied to 
the propositions. 

e Surface generation: it has to be decided how to say it, i.e., the realization of 
the syntactic structures and lexical choices. 


Now let us take a slightly simplified view of the text generation process 
as a pipeline of three stages: 1) Text planning (which determines the content 
and overall discourse structure of the text material), followed by 2) sentence 
planning (which decides on the sentence structure and scope), which in turn 
is followed by 3) surface form realization (which is based on syntax). So, the 
steps in a natural language generation system using aggregation techniques 
can be the ones in Figure 1. 
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Content Selection/Text Plan 


Y 
Input Proposition 


Y 
Sentence Planner-Agegregation 


Y 
Aggregated Text 


Y 
Surface Generation 


Y 
Complex Sentence 


Fig. 1. NLG System with Aggregation. 


If we apply our framework to the scheme of natural language genera- 
tion steps in Figure 1, we obtain a completely equivalent NLG system that 
consists of the same steps specified with a recombination terminology as 
shown in Figure 2. 

So, in our model we can say that aggregation is performed in the pattern 
recombination phase, whereas surface generation is equivalent to the gram- 
matical adaptation phase in recombination patterns. Taking into account those 
equivalences, we could define an NLG system in which once the content of 
the text is determined we receive a text plan where: 


1. we can recognize some patterns in the so-called pattern recognition phase; 

2. we can apply pattern recombination by performing a task analogous to 
the one performed by the so-called aggregation technique —as a matter of 
fact, what we do in such a phase is to DELETE redundant elements and, 
therefore, to provide a more readable text (the same task performed by 
aggregation); 

3. in the grammatical adaptation phase we perform a task analogous to sur- 
face generation in natural language generation mechanisms, since what 
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Content Selection/Text Plan 


Y 
Input Proposition 


Yy 
Pattern Recognition 


y. 


Pattern Recombination 


Y 
Recombined Pattern 


Y 
Grammatical Adaptation 


Y. 
Complex Sentence 


Fig. 2. NLG System with Recombination Patterns. 


we do here is to make the necessary modifications in order to provide a 
grammatical sentence (agreement, tense modification, and so on). 


6 Final remarks 


In this paper we have made a preliminary introduction of a new framework 
for linguistics based on the behaviour of DNA molecules. Recombination 
patterns seem to be quite suitable for explaining in a completely new way 
some syntactic phenomena. We claim that they are a powerful and simple 
model that can be very useful to: 


1. Reconstruct syntax with molecular methods. 
2. Formulate some systems capable of generating the larger part of syntactic 
structures of language. 
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3. Define a formalization that can be implemented and may be able to de- 


scribe and predict the behaviour of syntactic structures. 


The considerable number of similarities between aggregation and re- 


combination patterns has led us to suggest the possibility of carrying out 
aggregation by means of recombination. To the question ‘why recombina- 
tion patterns for aggregation processes?’ we can answer by referring to the 
main features of the model introduced here: 


Simplicity: 

— A limited number of patterns are sufficient to explain every syntactic 
phenomenon. 

— Itis possible to model syntax with just five operations. 

- It is based on cutting and pasting mechanisms, that are simpler and 
more efficient than rewriting. 

Naturalness: They reconstruct syntax with natural methods and therefore 

provide a natural model for natural languages. 

Linguistic base: They formulate systems to generate complex syntactic 

structures of natural language. 

Computational Suitability: They define a formalization of language that 

can be implemented. 
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Summary. In this paper we consider DNA and membrane computing, both as the- 
oretical models and as problem solving devices. The basic motivation behind these 
models of natural computing is using parallelism to make hard problems tractable. 
In this paper we analyze the concept of parallelism. We will show that parallelism 
has very different meanings in these models. We introduce the terms ‘or-parallelism’ 
and ’and-parallelism’ for these two basic types of parallelism. 


1 Introduction 


Over the last decade, molecular computing has been a very active field of re- 
search. The great promise of performing computations at a molecular level 
is that the small size of the computational units potentially allows for mas- 
sive parallelism in the computations. Thus, computations that are intractable 
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in sequential modes of computation can be performed (at least in theory) in 
polynomial or even linear time. 

In this paper, we investigate the way parallelism is used in different 
models of molecular computation. We are interested in the role parallelism 
plays in theoretical models (that is in the language-generating devices) as 
well as in the way parallelism is employed to solve computationally hard 
(typically NP-complete) problems. 

We focus on two branches of molecular computing, DNA computing and 
membrane computing. In the future, this work could be extended to other 
models of molecular computation, such as forbidding-enforcing systems, 
as well as other bio-inspired models of computation, like cellular automata 
and neural networks. 

The field of DNA computing was instigated by Leonard Adleman’s 1994 
paper [1], in which he reports a molecular solution of an instance of an 
NP-complete problem. Since then, much work has been done in this area, 
covering both experimental work and the formulation of formal and com- 
putational models. These models typically represent DNA strands as strings 
and model biochemical operations by string rewriting rules. The reader is 
referred to [11] for a detailed overview of the main computational models 
of this type. 

Membrane computing is an area of molecular computing initiated by 
Gheorghe Paun [8, 10]. A membrane system (also called P system) is a com- 
puting model inspired by the structure of a living cell. A membrane struc- 
ture defines regions where objects evolve according to given rules. From 
this basic structure, many different computational devices can be defined, 
depending on the objects used (strings, symbols), the types of rules allowed 
and the way the generated language is defined. Gheorghe Paun’s book [9] 
is a good introduction to the most important types of membrane systems. 

We will first analyze the nature of parallelism (see also [7]) in the formal 
computational models of both areas. Next, we examine the ways parallelism 
is used in both cases to solve computationally hard problems. 

We will try to avoid considering specific models. Instead we focus on the 
nature of the parallelism present in them, which we will see to be common 
to most models in the considered area. 
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2 Parallelism in DNA computing 


The field of DNA computing considers molecular computing in a variety of 
ways, which range from purely theoretical computational models to more 
practical ‘molecular algorithms’ to actual experimental implementations of 
molecular computations. The theoretical models include different types of 
systems such as splicing systems, sticker systems and deletion-insertion sys- 
tems. Details about these systems can be found in [11]. Here we do not 
consider Watson-Crick automata, which is not a parallel device: rather it is 
based on the inherent power of Watson-Crick complementarity. We claim 
that despite the different levels of abstraction and the different models, es- 
sentially the same type of parallelism underlies all systems of DNA com- 
puting. 

In experimental DNA computing, the working assumption is that all 
molecules are present in such huge quantities that they can be considered 
infinite. All formal models considered share this assumption: We start from 
a (generally finite) initial language L, with all words w € L present in arbi- 
trarily many copies. Similarly, in experiments, a series of biochemical opera- 
tions is applied sequentially, but each biochemical operation applied affects 
all molecules present. This is reflected in the theoretical models, where each 
word is rewritten sequentially, but this sequential rewriting is applied to 
all words in parallel. In this way, one computation gives all possible solu- 
tions. In the context of language generating systems, we can say that one 
‘run’ of the system gives the entire generated language. This is an impor- 
tant difference with respect to most known models of computation, such as 
the Turing machine or Chomsky grammars, where one run of the system 
accepts or generates just one word in the language. 

As an example of this type of parallelism we consider a molecular algo- 
rithm presented in [4]. Although this is a theoretical algorithm, experimental 
implementations of this or very similar algorithms are reported in [3], [4] 
and [5]. In addition to exemplifying parallelism, this example also shows 
how this parallelism can be used to reduce the computational complexity of 
NP-complete problems. 

The algorithm we consider solves the satisfiability problem for disjunc- 
tive clauses (SAT). Suppose the Boolean variables of the formula are p, to 
Pn and the number of clauses is m. Suppose, moreover, we have a molecule 
with 2n sites on which we can ‘write’ (i.e. change) the site in such a way 
it can later be recognised or ‘read’ as being written. An unwritten site 
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is interpreted as 1 (true) and a written one as 0 (false). We start with a 
single molecule which encodes 2n 1’s. We interpret this as the values of 


Pi, P11 P2, 7P2,+-Pns 7Pn- 
Now, the algorithm for solving SAT is the following: 


1. For each variable y, divide the solution into two parts. In one part, write 
the site for y. In the other part, write the site for >y. This yields all 
consistent assignments of variables. 

2. For each clause, divide the content of the test tube. If for instance the 
clause is (p; V sp;), we divide it into two parts. In one part, remove all 
molecules which have =p; = 1, in the other, those which have pj = 1. 
Thus only molecules which satisfy this clause remain. 

3. Check if a molecule remains. If so, the answer is ‘yes’, otherwise ‘no’. 


We see that all the solutions are generated and checked in parallel, and in 
this way we can simultaneously explore all options, and solve the instance 
of SAT in O(n + m) biochemical operations (i.e. in linear time), assuming 
each operation takes constant time. 

Fig. 1 shows a sketch of the DNA computing method. In this example 
there are three kinds of original DNA-molecules with sticky ends. Two of 
these kinds can make a new molecule. The result can be seen in the figure: 
long molecules without sticky ends (for instance, the last one). The third 
kind of molecule can also make new molecules. For example, the second 
molecule is the result of their reaction. Technically, by amplifying we can 
multiply the number of present; the assumption is that there are enough 
molecules of each kind in the sup. We assume that all possible ways of 
continuing the computation are present in the same time. 


3 Parallelism in membrane computing 


A Membrane system (also called P system) is a computing model which 
abstracts from the way living cells process chemical compounds in their 
compartmental structure. This amounts to a membrane structure that de- 
fines different regions which evolve according to given rules. The objects 
can be described by symbols or by strings of symbols (in the former case 
their multiplicity matter, (that is, we work with multisets of objects placed 
in the regions of the membrane structure); in the second case we can work 
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Fig. 1. Several DNA molecules of the same type and all possible ways of continuing 
the computations are present 


with languages of strings or, again, with multisets of strings). By using the 
rules in a nondeterministic, maximally parallel manner, one gets transitions 
between the system configurations. A sequence of transitions is a compu- 
tation. With a halting computation we can associate a result, in the form 
of the objects present in a given membrane in the halting configuration, or 
expelled from the system during the computation. Various ways of control- 
ling the transfer of objects from one region to another and of applying the 
rules, as well as (using so-called active membranes:) possibilities to dissolve, 
divide or create membranes were considered. 

Many of these variants lead to computationally universal systems, while 
several variants with enhanced parallelism are able (at least theoretically) to 
solve NP-complete problems in polynomial (often, linear) time, by making 
use of an exponential space. 

Parallelism can be controlled through cooperative rules, catalysts, etc. A 
simple membrane computer with symbol objects can be seen in Fig. 2. 


Let us see how a membrane system can generate languages. First, as 
the most usual case the so-called multiset languages are considered. The 
membrane system in Figure 2 starts with a copy of the objects a and b in 
membrane 1. In any subsequent configuration (except the halting one) there 
is also exactly 1 copy of a and b in membrane 1. If a or b is processed 
by the rule sending symbols into membrane 2 the other symbol should 
be processed by the same type of rule, for maximal parallelism. In these 
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a— a(a,in2) (a,in2) 1 
b — b (6,in2) (b,in2) (b,in2) 
ab — (a,out) (b, out) 


2 a 
b 


Fig. 2. A cooperating membrane system 


parallel steps two copies of a and three copies of b appear in membrane 
2. No rule is available in membrane 2 so all copies inside still remain with 
no change. The process continues until the cooperative rule is used to send 
both a and b out of membrane 1. The computation halts with this step. A 
word of the generated language is in membrane 2 (we consider it as the 
output membrane). This membrane contains objects a2"b>" depending on 
the length of the process. So, the generated multiset language as a Parikh- 
set is (2n,3n). 

As we have seen in the membrane system, parallelism is inside the com- 
putation of a (multiset) word. The result of a computation process is a par- 
ticular word of the language, and it can be any because of non-determinism. 
To generate the whole language, we need to restart the computation many 
times (usually infinitely many times). So, the power of parallelism is used 
inside the computation of a word: and this means that each word can be 
computed quickly and effectively. 

Sometimes the objects sent out of the system build the result word 
(traces). Parallelism has the same role here: the whole system constructs 
only a very particular part (a word or words whit the same Parikh-vector) 
of the language in a run. 

The basic idea of parallelism is the same whenever we consider mem- 
brane systems using catalysts, evolution rules, priorities, cooperative rules, 
symport, antiport, electrical charges, dissolutions, creating and/or dividing 
membranes. The only difference is that by using active membranes one can 
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dynamically play with the structure of the system as well. With more mem- 
branes one can easily organize the derivation process because the mem- 
branes can have various rule sets. The creation and division of membranes 
enables independent computations to be performed in parallel way, too. 

Now let us analyze how membrane systems can be used for effective 
problem solving. We briefly describe how membrane creating can be used 
to solve SAT in linear time. First we have an initial membrane with only 
one object. Applying the only applicable rule for this object we introduce 
two new objects corresponding to the possible values of the first variable, 
and a technical object to continue the process. Then the new objects of the 
logical variable create new membranes (and copy some symbols to the new 
membranes). Now for each new membrane two new objects are introduced 
for the next variable. These new objects create new ones again, etc. Finally 
the membrane structure forms a complete n-level binary tree. Each path 
from the initial to a leaf-membrane represents a possible truth-assignment. 
Now, each membrane in the n-th level computes objects for satisfied clauses 
of the SAT formula. (This can be done easily by a comparing the literals 
of the clauses and the given truth-assignment of the membrane.) Using a 
cooperative rule a special symbol is sent out if all clauses are satisfied in 
a membrane. In the next step the previous level membranes forward these 
symbols. Therefore, this special symbol moves up all n levels, and finally 
leaves the system and terminates the process with answer ‘satisfiable’. More 
technical details can be found in [9]. 

In this process the power of parallelism builds up a complete tree by lev- 
els in linear time. In each membrane, at the deepest level there are rules for 
each clause, so clauses can be evaluated in a parallel way. (Here parallelism 
is used in the same way as in language generation.) 


4 A brief comparison 


Now we describe some essential differences between DNA and membrane 
computing. 

In DNA-computing we assume that there are (theoretically) infinitely 
many copies of each string of the initial language. We also assume that 
infinitely (arbitrarily many) many copies of each string can appear in the 
molecular soup and, therefore, all possibilities are explored. In a mathemat- 
ical model, we can use sets to describe the configuration of the system. The 
assumption is that the ‘space’ is already given. 
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In contrast membrane computing multisets. The numbers of the present 
objects are very important. Membrane computing use ‘active membranes’ 
(division, creation, etc). These allow all options to be explored, space traded 
for time, etc. In a way, an extra level of parallelism is needed for efficiency, 
the parallelism of rule application does not play any significant role. 

In DNA computing we can try all possibilities (maybe in a clever order, 
to reject the false ones as soon as possible) to have a solution. If there is any, 
then a (some) try will be successful. 


5 Notions of parallelism 


Abstracting from the specific systems we can identify two essentially differ- 
ent notions of parallelism. We classify those here. 

In the ’and-parallelism’ the computation needs several branches. These 
branches provide some subresults. Typically, these involve rewriting in par- 
allel way, but there is a dependence between parallel computations. For 
instance, a parallel computation generates one word. It affects both compu- 
tational power and complexity. The non-deterministic massive parallel way 
of applying the rules in the membrane system fits this notion exactly. 

The ‘or-parallelism’ (which we could also call ‘Chinese army paral- 
lelism’) is the following. The parallel branches independently try to solve 
the problem. Any of them can produce the solution. 

Computations are independent and they are performed in parallel. One 
example is parallelism, which generates all words simultaneously. This type 
of parallelism only affects complexity. We assume that the space can be 
considered arbitrarily large. Parallelism in DNA computing, where each 
word is rewritten sequentially, but all words are rewritten in parallel is of 
this type. The use of active membranes in membrane computing also pro- 
vides this kind of parallelism. Note that this notion is closely related to 
non-determinism in the usual sense. This concept of parallelism considers 
all the ways in which a non-deterministic algorithm can run at the same 
time. 


6 Conclusions 


In this paper we have analyzed two fields of natural computing, DNA com- 
puting and membrane computing, and studied the parallelism present in 
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them. Even though the parallelism is the main motivation for these fields 
and an important property of both computational systems and problem 
solving algorithms, it has never really been studied in itself. Looking at 
the way the systems in these areas work in terms of parallelism, we made 
two important observations. Firstly, in spite of the diversity of computa- 
tional systems and even levels of description (experimental implementa- 
tions, problem solving algorithms, language generating devices), we can 
still make general statements about the parallelism present in these fields, 
because the underlying notion of parallelism is essentially the same in all 
models. Secondly, we noted that the notion of parallelism used in DNA com- 
puting is very different from the parallelism present in all membrane sys- 
tems. Whereas systems in DNA computing have parallel independent com- 
putations, in membrane systems the parallelism is applied in a dependent 
way, with all parallel computations taking place inside the rewriting of a sin- 
gle configuration. However, in membrane computing there exists a subclass 
of membrane systems called membrane systems with active membranes, 
which in addition to the parallelism present in all membrane systems al- 
lows for another type of parallelism, which is the same type of parallelism 
as in DNA computing. It is precisely this additional kind of parallelism 
that makes it possible to provide efficient solutions to computationally hard 
problems. 

We have also described the concept of parallelism in computations in 
a more general way. There are two basic notions. In the so-called ’and- 
parallelism’ the results of several parallel branches are needed to provide 
the result of the computation. We use parallelism for computation to be ef- 
fective: parallel branches work somehow on the same ‘state’ or configuration 
of the system. In ‘or-parallelism’ (analogous to the so-called Chinese army 
algorithm) the branches are independent and any of them can produce the 
final result. We use parallelism because we do not know which branch will 
be successful. This concept can be imagined as a non-deterministic machine 
running in all possible ways at the same time. 
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Summary. Tom Head (1987), in his pioneering work on formal language theory ap- 
plied to DNA computing, introduced a new operation of splicing on strings, while 
proposing a model of certain recombination behaviour of DNA molecules under the 
action of restriction enzymes and ligases. Since then this operation has been studied 
in great depth giving rise to a number of theoretical results of great interest in for- 
mal language theory. Extension of this operation of splicing to higher dimensional 
structures such as circular words, arrays, trees and graphs have been proposed in 
the literature. Here we examine the effect of certain specific forms of the splicing 


operation applied to arrays and graphs 
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1 Introduction 


There has been a lot of interest among researchers in the study of formal 
language theory applied to DNA computing. A specific model of DNA 
recombination is the splicing operation which consists of “cutting” DNA 
sequences and then “pasting” the fragments again, under the action of re- 
striction enzymes and ligases. In [7], Tom Head defined splicing systems 
motivated by this behaviour of DNA sequences. The splicing systems make 
use of a new operation, called splicing on strings of symbols. These systems 
are a new Class of generative systems, intended to model certain recom- 
binant behavior of DNA molecules and are of current interest and study. 
Theoretical investigation of splicing on strings has been extensively done by 
different researchers [8]. 

Extension of the splicing operation to graphs has been proposed by Fre- 
und [4]. Relationship between graph splicing languages of Freund [4] and 
Hyperedge replacement graph languages [6] is examined in [3]. Sakakibara 
and Ferretti [13] have introduced and studied splicing of tree structures. 
Krithivasn [9] has considered a different kind of splicing of graphs. 

On the other hand, in syntactic approaches to generation and recog- 
nition of picture patterns, considered as arrays of symbols, several two- 
dimensional grammars have been proposed and studied [ 5, 12, 15, 16, 17, 
18, 19, 20] extending and generalizing the techniques of formal string lan- 
guage theory. Splicing of arrays structures which could be thought of as 
graphs on grid structures has also been recently considered [2,10]. A simple 
but effective method of splicing on images of rectangular arrays is intro- 
duced in [2] as an extension of the operation of splicing on strings. 

Mateescu etal [11] introduced a special kind of a splicing rule, called 
simple splicing rule and investigated the effect of this type of rule on words. 
This notion of simple splicing has been examined in [1] for circular words. 
As an application of the concept of simple splicing, it is natural to extend 
this special form of splicing to higher dimensional structures such as graphs, 
trees and arrays. 

Here we introduce and examine the analogue of simple splicing for pic- 
ture patterns of rectangular arrays in the context of domino splicing rules of 
[2]. We compare the resulting systems called Simple array splicing systems 
with two other picture describing models. We then introduce the notion of 
simple splicing on trees which allows us to obtain a set of trees. The re- 
sulting system, called Simple Tree splicing system, describes the derivation 
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trees of context-free grammars. We also consider the string language con- 
sisting of words associated in a standard way with trees . It is known that 
simple splicing yields only regular languages [11]. It is of interest to note 
that, in contrast to the string case, the picture language class obtained by the 
simple array splicing systems is incomparable with the two other regular- 
like picture classes and in the tree case, the associated string languages are 
context-free languages. 


2 Simple Array Splicing Systems 


We first recall the notion of simple splicing on words [11]. For notions of 
language theory we refer to [14]. 


Definition 1. Let V be an alphabet. $, # are two special symbols, not in V. A simple 
splicing rule over V is a string of the form r = a# A $ a# A, wherea € V. 
For such a rule r and strings x,y,z € V*, we write (x,y) +, z if and only if 
X = xXAX2, Y = Y1aY2, Z = XAY2 for some xX1,X2,Y1,Y2 € V*. We say that z is 
obtained by splicing x,y, as indicated by the rule r. 


A simple splicing scheme is a pair 7 = (V,R), where V is an alphabet 
and R is a set of simple splicing rules. A simple splicing scheme is also 
referred to as a simple H scheme [11]. For a given simple splicing scheme 
o = (V,R) and a language L C V*, we define o(L) = {z € V* / (x,y) Fy z 
for some x,y € L, rE R } 


We now introduce the notions of simple domino splicing rules and sim- 


ple array splicing Systems (SAS). These systems are a special kind of H array 
splicing systems introduced and studied in [2]. 


Definition 2. Let V be an alphabet. #,$ are two special symbols, not in V. A col- 


: a 
umn domino over V is of the form 


A and a row domino is of the form |a|b| for some 
a,b € V. Both a,b can be A or #. 


A simple column or row domino splicing rule over V is of the form r 
= « #2 § 6B # A where both a and 6 are column dominoes or both row 
dominoes.We refer to «, 6 as the first and third dominoes of r respectively. 


; TRIANGLE 6 e December 2011 


122 K.G. Subramanian, A. Roslin, P.H. Chandra 


Given two arrays X and Y of sizes m x p and m x q respectively, 


ay eee Ay; aan Ap 
ary eee 42,j eee 2p 
X=, 
Am Amj °° Amps 
by4 Dig ++ Dag 
boy bo +++ Dag 
Y=, 
Dini ++ Dink bing 


Air, bis © V, forl <i<m,1<r<p,1<s <q. Let X* and Y* be bordered 
arrays of sizes (m+ 2) x (p+2), (m+2) x (q +2) obtained by surrounding 


X and Y with # symbols, as shown below. 


# # 


# ay1 seen 
# Ap] sisiers 


xt 


# Ami °° 


# # 
# # 


obi oes 
B Gajeses 


yi 


| ee 


# # 


# 


Ding # 
# # 


Here we refer to the top and bottom rows of #/s as the 0" and m + 1% 
row. Similarly for the leftmost and rightmost columns of #’s. 


We write (X,Y) 


|2 Z if there is a sequence of simple column splicing 


rules 10,11, 12, +m (not necessarily all different) such that 


Tj => %i,j | # 


A 


$ 


bi A 


ist, 


# 


Dis1k A 


and 4i,j = bik 
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for alli, 0<i< mand for some j,kO<j<p+1,0<k<q+1land 


#ay ++ aij by +s Dig # 


ot # ary +++ aj bop +++ Dog # 


# Ami -** Am, Dink *** bing # 


In other words, we can imagine that a 2 x 1 window is moved down the 
j'” column of X*. The sequence of dominoes collected are the first dominoes 
of the rules 79,11,12,...,'m (not all necessarily different). Likewise for the kth 
column of Y* except that the dominoes collected are the third dominoes of 
the rules. When such rules exist in the system, the simple column splicing 
of the arrays X and Y amounts to the array X* being vertically “cut” after 
j' column and the array Y* after k‘” column and the resulting left subarray 
of X* “pasted” (column catenated) with the right subarray of Y* to yield 
Z*. We now say that Z is obtained from X and Y by simple domino column 
splicing in parallel, where Z is Z* with surrounding # symbols deleted. 

We can similarly define simple row splicing operation of two arrays U 
and V of sizes p x n and q x n, using simple domino row splicing rules to 
yield an array W. We write (U,V) |2  W. As done for the column splic- 
ing of arrays, we can imagine 1 x 2 windows being moved over respective 
rows. The row splicing of the arrays U and V can be thought of as U* being 
horizontally “cut" below the r‘” row and V* below s‘” row and the upper 
subarray of U* “pasted" (row catenated) to the lower subarray of V* to yield 
W*. We now say that W is obtained from U and V by simple domino row 
splicing in parallel, where W is W* with # symbols deleted. 


We now introduce the main notion of Simple Array Splicing Systems. 


Definition 3. A Simple array splicing scheme is a triplet T = (V,Rc,R;) 
where V is an alphabet, R, = a finite set of simple domino column splicing rules, 
and R, = a finite set of simple domino row splicing rules. 

For a given Simple array scheme T = (V,R¢,R,) and a language L C V**, we 
define 
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[(L) ={Z € v* /(x*y*) (2 Zor (X*¥*) |& Z* for some X,Y € 
L}. In other words, T(L) consists of arrays obtained by column or row splicing any 
two arrays of L using the simple domino column or row splicing rules. 


A Simple array splicing system (SAS) isdefinedby 5S = (I,I) 
where I = (V,R.,R,) and I is a finite subset of V**. The language of S is 
defined by L(S) = I*(I) and we call it a Simple array splicing language 
(SASL) and denote the class of such languages by £(S ASL). 


We illustrate with an example. 


Example 1. Let V = {x,a}, | a cna 


nen Ge a Bek 
mes gob 
mle Bs Hae 
nfs Bs Ha 
mee Be dea} 

R= {a :fa] # AP) & [xa] # AP 


as: [A] # AD) $s i # BAY } 
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L is the language consisting of pictures of the form in Figure 1, where 
white square is interpreted as a and black as x. 


Fig. 1. Rectangles of a’s surrounded by x’s 


x. 
Example 2. Let V = {x,.}, T= (x... 9, 
XXX 
# ‘| # | 
Re= {pri} # r $¢ # ey 
aa @ Ta, 
pase ig eR 
A a 
PS *Ty : A $c x| = A 
Ae AI x nN 
Pala A] Sool) * 
R= {o:(E] # AA) $ fx # AP 
gz: |x|.) # [AJA] $, x]. # [AIA 
q3:|.|.| # [AJA] $ [J # [AA 
qa: (#] # A] $s [FB # Al } 


L is the picture language consisting of all m x n arrays describing token 
L of x's. 
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x 

x 

x 

a ary x 
x 

x 

xXxXXXXX 


Fig. 2. Array describing token L Token L of x’s 


Example 3. Let V = {a,b}, I= i: b \ : 


| 


FR 


FE] S 


>) 


a 
i-S) 

sk 
Le] =| 
ar ae 


R= {qi [ab # [AlAl $ [ajo] # [AIA 


gz: |bla| # |A\A| $ |bla| # JA\A 


q3:[#b] # [AJA] $ [Hb] # AIA 


af] # DE) $ fi # pl} 


L is the language consisting of all “chessboards" with even side-length. 
i.e. pictures of the form in 2. 


The picture or pattern of Figure 2 can be represented by an array M, where 
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Fig. 3. Chessboard Pattern 


ababababab 
bababababa 
_ababababab 
~ bababababa 
ababababab 
bababababa 


where ‘a’ stands for black and ‘b’ for white. 

We now compare the generative power of Simple array splicing sys- 
tems with other picture description models. The class LOC [5] of local pic- 
ture array languages described by 2x2 “windows" and the class of two- 
dimensional right-linear languages described by 2RLG [5, 15] are two of the 
basic classes in the hierarchy of picture array describing grammars. 


Theorem 1. The classes LOC [5] of local array languages and £(SASL) of Simple 
array splicing languages are incomparable but not disjoint. 


Proof: The picture language M consisting of all m x n arrays (m > 2,n > 2) 
describing token L of 1’s is in LOC [2]. It is also described by a Simple array 
splicing system S = (V, R-,R;,1) as in example 2. 


It is known [5] that the picture language L of all rectangular arrays over 
V = {a} with 3 columns is not in LOC. In fact, it is not possible to fix the 
number of columns only one symbol, as the block | in can be moved 


without restriction on the columns. But it is generated by a Simple array 
splicing system where 
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aaa 
Il=<aaah, 
aaa 


Ry = {a1 : (ala # fAlAl $ (aa) # [AA 


qo: |#]a| # AIA $ |#lal # AIA 


as :(aif] # AD) $ fe] # DP } 


and R,=@ 


It is known [5] that the picture language of square images in which di- 
agonal positions carry symbol 1 but the remaining positions carry symbol 
0 is in LOC. But it is not in SAS. Since row and column splicing are inde- 
pendently done, it is clear that arrays with a proportion between rows and 
columns and in particular pictures with only square size cannot be gener- 
ated by any SAS. 


Theorem 2. The class £(S ASL) of Simple array splicing languages and £L(2RLG) 
[5,15] of picture languages generated by two dimensional right linear grammars are 
incomparable but not disjoint. 


Proof: The picture language of "chessboards" with even side-length (Figure 
3) is also generated by a SAS (Example 2.3) and is known to be generated 
by a 2RLG [5,15]. 

The picture language L consisting of arrays describing token H (Figure 
4)cannot be generated by any 2RLG, as the horizontal row of x’s cannot be 
maintained by any 2RLG [5,15]. But the language consisting of picture ar- 
rays describing token H with three rows and any number of columns can 
be generated by the following SAS : 


xX. . Xx 
Let V = {x,.}, IT=<xxxx), 


, A : 
Re= {pu # $ # 


pP3: # 4 $ # 
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Xe we 2 eX 
> ar ae, 
xXXXXXX 
x x 
x x 
x x 


1, Ae Ha A 
ae a x 
El o Ble Bec 
Poi * Ty) 8 ig FA 
U 


The picture language Lz = {((ab)P 


x x 
x x 
x x 


\ and Rr =@ 


(ba)1)m / p,q,m > 1} cannot be de- 


scribed by any SAS. This is due to the fact that the column splicing of any 
two arrays ((ab)?) and ((ba)1)m will yield an array which is not in Lo. But 
it is generated by the following 2RLG. 


X= {a,b}; 
Ly = {A1, Az, A3, Aa}; 
Vi, = {S,X}; 


Ry = {S 4 AyA2X; S > A3A4Y; X 4 A A2X; Y @ Ag AgY; 


X — Ay Aa; Y > AgAa}; 


Vo = {A1, Az, A3, Aa}; 


R, = {Ay — aA; Ay > a; A2 > bA»; Ad —-> b; A3 —> bA3; A3 > D; 


Ay — aA4; Ay > a} 


Remark 1. We have the following relationship among SAS,LOC,2RLG as 


seen from Theorems 1 and 2. 


Note that the picture language of “chessboards" with even side-length is 
generated by a SAS (Example 2.3) and is known to be generated by a 2RLG 
[5]. It is also a local language.Thus it is a picture language in all the three 


classes. 
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SAS 


Fig. 5. Relationship among SAS, LOC,2RLG 


In fact the set T of 2 x 2 windows describing this language is: 


##H#H#HDAHHHHH 
#a,b#,##,##,ab,ba 


#batabbaabba 
#a,b#,##,##,ba,ab 


1000#ab# 
11,11,#b,a# 


3 Simple Tree Splicing Systems 


We now introduce the notion of Simple splicing on trees as an application of 
the simple splicing rules introduced in [11]. The splicing of trees considered 
here is a special form of a general notion of splicing of trees considered in 
[13]. 

We consider labelled rooted trees T which are connected cycle-free 
graphs with a designated node r called the root of the tree and a label I(v) 
for every node v. Since there is a unique simple path from the root r to any 
other node v in T, this determines a direction to the edges of the tree and 
thus tree is viewed as a directed graph with a precedence relationship such 
that every node has zero or more descendant nodes. A node with a zero 
descendant is called a leaf and any other node is called an interior node. 
A subtree of a given tree T is also a labelled rooted tree with its root as an 
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interior vertex of tree. 


It is usual to denote a tree T with root label a and subtrees 7},..., T, with 
root labels by,...,b; in linear form as given below T = a(T},..., Tn). We call a 
labelled rooted tree T with labels in a set V, simply as a tree T over V. The 
yield of a tree T is a string obtained by reading the labels of the leaves from 
left to right. 


Example 4. Let V = (a,b,c). let T be a tree T = c(a,c(a,c(a,c,b),b),b). The 
yield of tree T is a°cb°. 


We now introduce the notion of simple splicing of two trees, by consid- 
ering simple splicing rules used in the splicing of words[11]. 


Definition 4. Let V be an alphabet. Let r be a simple splicing ruler = (cHA$c#A) 
where c € V. Let T; and Tz be two trees such that c is a root label of a subtree ie of 
T, and a subtree iv of Tr. We say that a tree T3 is obtained by simple tree splicing 

of T; with To, if T3 is the tree obtained from T, removing the subtree i with root 

v having label c and attaching es at the node v. 


A simple tree splicing system S = (V,A,R) where Vis an alphabet, A 
is a finite set of trees over V and R is a finite set of simple tree splicing 
rules. The tree language T(S) consists of all trees obtained from the trees in 
A by repeatedly applying the tree splicing rules of R. The string language 
associated with the system S is the set L(S) of words which are the yields 
of the trees in T(S). 


Example 5. Let V = (a,b,c). let T be a tree T = c(a,c,b). Let S = (V,T,r) 
where r = c#A$c#A. On splicing T with itself we obtain a tree T = 
c(a,c(a,c,b),b). Repeatedly using the splicing operation, we obtain a set of 
trees c(a,c,b),c(a,c(a,c,b),b),c(a,c(a,c(a,c,b)b)b),... which constitutes the 
tree language T(S). The string language of S is L(S) = a"cb":n > 1. 


We exhibit the relation between the set of derivation trees of a context 
free grammar and the tree language of a simple tree splicing system. 
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Theorem 3. Given a context free language G = (Vy, Vr, P,S), there exists a sim- 
ple tree splicing system S such that the tree language T(S) is exactly the set of 
derivation trees of G. As a consequence the context free language L(G) generated 
by G is simply the string language of L(S) 


Proof : Assume that the given CFG is in Chomsky Normal Form with rules 
of the form A — BC or A > a where A,B,C are non-terminals and a is a 
terminal. For each rule A — BC we associate a tree A(B,C). For each rule 
A — a we associate a tree A(a). A corresponding simple tree splicing sys- 
tem S is constructed as follows : S = (Vy U Vr, A’,R) where A’ consists of 
trees associated with the rules of P. R consists of simple splicing rules of 
the form X#A$X#A whenever X is the left hand side of a rule in G and is 
a symbol in the right hand side of a rule in G. It is straight forward to see 
that T(S) consists of exactly the derivation trees of G. 


Remark 2. It is known that the string language of a simple H system [11] is 
regular. It is of interest to note from the theorem 3.1 that the string language 
of a simple tree splicing system is context free and thus the generative power 
of the simple splicing rules is increased when they are applied on the tree 
structures. 


4 Conclusion 

In this paper a theoretical study of a special class of H Array Splicing Sys- 
tems [2] with “Simple” array splicing rules, is made. Although the rules are 
“simple”, the generating power of the picture generating SAS or tree gen- 
erating STS is reasonably higher. It remains to examine other properties of 
these systems such as characterizations etc. as in the string case. 
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